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PRODUCTION OF PANCREATIC ISLET CELLS 
AND DELIVERY OF INSULIN 



CROSS-REFERENCE TO RELATED APPLICATION 

This application is a continuation-in-part of U.S. Application Serial No. 
09/535,145, filed March 24, 2000 and also entitled to the benefit of U.S. Provisional 
Application Serial No.60/128,180, filed April 6, 1999, the disclosures of which are 
incorporated herein by reference. 

FIELD OF THE INVENTION 

The invention relates generally to the field of delivery of insulin to a subject by 
production of islet cells, particularly insulin producing beta cells. More particularly, the 
invention relates to production of islet cells by modulation of expression of nucleotide 
sequences encoding transcription factors involved in growth and differentiation of 
pancreatic islet cells. 

BACKGROUND OF THE INVENTION 

Diabetes mellitus is the third leading cause of death in the U.S. and the leading 
cause of blindness, renal failure, and amputation. Diabetes is also a major cause of 
premature heart attacks and stroke and accounts for 15% of U.S. health care costs. 
Approximately 5% of Americans, and as many as 20% of those over the age of 65, have 
diabetes. 

Diabetes results from the failure of the p-cells in the islets of Langerhans in the 
endocrine pancreas to produce adequate insulin to meet metabolic needs. Diabetes is 
categorized into two clinical forms: Type 1 diabetes (or insulin-dependent diabetes) and 
Type 2 diabetes (or non-insulin-dependent diabetes). Type 1 diabetes is caused by the 
loss of the insulin-producing P-cells. Type 2 diabetes is a more strongly genetic disease 
than Type 1 (Zonana & Rimoin, 1976 N. Engl. J. Med. 295:603), usually has its onset 
later in life, and accounts for approximately 90% of diabetes in the U.S. Affected 
individuals usually have both a decrease in the capacity of the pancreas to produce 



insulin and a defect in the ability to utilize the insulin (insulin resistance). Obesity 
causes insulin resistance, and approximately 80% of individuals with Type 2 diabetes are 
clinically obese (greater than 20% above ideal body weight). Unfortunately, about one- 
half of the people in the U.S. affected by Type 2 diabetes are unaware that they have the 
disease. Clinical symptoms associated with Type 2 diabetes may not become obvious 
until late in the disease, and the early signs are often misdiagnosed, causing a delay in 
treatment and increased complications. While the role of genetics in the etiology of type 
2 diabetes is clear, the precise genes involved are largely unknown. 

Insulin is made exclusively by the P-cells in the islets of Langerhans in the 
pancreas. During development, the islet cells, including the P-cells, develop from an 
undifferentiated precursor within the growing pancreatic bud. As the bud grows, the 
undifferentiated cells form into ducts, and it is these cells that function as precursors. 
Duct cells appear to retain the capacity to differentiate into islet cells throughout life, and 
in some circumstances when the pancreas is damaged, new islet cells can form from the 
duct cells. Unfortunately, islet cell regeneration does not appear to occur when the islet 
cells alone are damaged, such as in type 1 diabetes. 

This developmental process is clinically relevant for several reasons. First, the 
formation of islet cells and especially p-cells is necessary in order to make insulin and 
control energy metabolism. If the process of p-cell development is in anyway impaired, 
it predisposes that individual to the later development of diabetes. Therefore genes 
involved in this process are candidate genes for neonatal diabetes, maturity onset 
diabetes of the young (MODY) or type 2 diabetes. The sequence of these genes could be 
used to identify individuals at risk for the development of diabetes, or to develop new 
pharmacological agents to prevent and treat diabetes. 

Second, as discussed above, insulin production is impaired in individuals with 
diabetes. In type 1 diabetes the impairment is caused by the destruction of the p-cells, 
while in type 2 diabetes, insulin production is intact, but inadequate. Treatment of type 1 
diabetes, as well as many cases of type 2 diabetes, may involve replacement of the 
p-cells. While replacement of P-cells may be accomplished in several ways, the 
development of new p-cells from precursor cells, either in culture or in vivo in the 
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patient, would be the most physiologic. To do this, the molecules that control P-cell 
differentiation are needed. 
[08] For these reasons, the diabetes field has spent considerable effort in attempts to 

identify islet precursor cells, and to develop methods for differentiating beta-cells in 
vitro. To date this has been largely unsuccessful. The present invention addresses this 
problem. 

Literature 

[09] A cloned fragment of mouse Ngn3 is described in Sommer et al. 1996 Mol. Cell. 

Neurosci. 8:221. 

[10] cDNA and amino acid sequences of murine Ngn3 and murine mammalian atonal 

q homology 4B (MATH4B) are described at GenBank Accession Nos. U76208 and 

Y09167, respectively. The human ngn3 gene and mRNA are described at GenBank 
Accession Nos. AJ1 33776 and NM_020999, respectively, 
fjj [11] cDNA and amino acid sequences of the rat relax transcriptional regulator are 

iji 

described at GenBank Accession No. Y 106 19. 

i: 

W SUMMARY OF THE INVENTION 

p [12] The present invention relates to the production of islet cells and insulin in a 

o 

:T subject by providing for expression of an islet transcription factors in the pancreas of the 

jar 

subject, by for example, introduction of nucleic acid encoding the transcription factor 
neurogenic or a factor that induces neuorgenin3 expression. The present invention also 
relates to methods for using a islet transcription factor gene and the islet transcription 
factor polypeptide to alter cellular differentiation in culture or in vivo to produce new P- 
cells to treat patients with diabetes mellitus. 

[13] A primary object of the invention is to provide for the production of islet cells in 

the pancreas of a subject. 

[14] Another object of the invention is to provide for the production of insulin in a 

subject by inducing the formation of functional p-cells. 
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Another object of the invention is to provide a method for using the islet 
transcription factor genes to alter cellular differentiation in culture or in vivo to produce 
new P-cells to treat patients with diabetes mellitus. 

Another object of the invention is to provide for production of islet cells 
for ex vivo therapy, e.g. production of islet transcription factor expressing cells for 
transplantation into a subject. 

These and other objects, advantages and features of the present invention will 
become apparent to those persons skilled in the art upon reading the details of the 
invention more fully set forth below. 

The invention will now be described in further detail. 

BRIEF DESCRIPTION OF THE DRAWINGS 
FIG. 1 is a map of the Adeno-X.NGN3 viral construct, which contains the murine 
neurogenic sequence operably linked to the CMV promoter. 

FIG. 2A is a photograph showing histological staining for insulin in a pancreatic 
tissue section of an animal injected with Adeno-X.NGN3 at high magnification (40X) 
which indicates the production of insulin in accordance with the invention. The arrows 
indicate insulin staining cells. 

FIG. 2B is a photograph of a serial section of FIG. 3 A, stained for the duct cell 
marker cytokeratin20. The arrows indicate the position of insulin staining cells in the 
serial sections. 

FIG. 3 is a photograph of a confocal image with fluorescent co-staining for 
insulin in red and neurogenin3 in green in the pancreas of an animal injected with 
Adeno-XZ.NGN3. The production of neurogenic and insulin can be detected in the 
same cell in accordance with the invention. 

FIG. 4A is a photograph showing histological staining for insulin in a pancreatic 
tissue section of an animal made diabetic with streptozotocin and injected with Adeno- 
X.NGN3 indicating the production of insulin producing cells in accordance with the 
present invention. Red arrows indicate individual and small clusters of insulin staining 



cells. Black arrows indicate residual and degenerating cells remaining in formed islets 
after streptozotocin treatment. 
[24] FIG. 4B is a photograph showing staining for insulin in a pancreatic tissue 

section of an animal injected with Adeno.LacZ, a control viral construct. Black arrows 
indicate residual and degenerating cells remaining in formed islets after streptozotocin 
treatment. Note individual and small clusters of insulin staining cells are seen. 
[25] FIG. 5 is a graphical illustration of blood glucose levels of streptozotocin-treated 

animals (diabetic) that received either AdenoX-.NGN3 or Adeno.LacZ (control animals). 
[26] FIG, 6 is a photograph showing the expression of the islet hormone in transgenic 

mice at embryonic day 12.5. Immunohistochemical staining is shown for glucagons in 
pancreases of a non-transgenic littermate (A) and transgenic fetuses expressing ngn3 (B) 
or neuroDl/BETA2 (C). 
[27] FIG. 7 is a photograph showing the gut and associated organs embryonic day 

1 8.5 for fetal mice; over-expressing (a) ngn3 or (B) neuroDl under the control of the 
pdxl promoter. Non-transgenic littermates are shown at the right in each panel. Arrows 
^ point towards the following organs: liver (L), stomach (St), spleen (sp), and pancreas 

(P). Reduced pancreatic tissue is not visible in the transgenic animals. 
[28] FIG. 8 is a map of the Adeno-X. NeuroDl and Adeno-X.MASHl viral construct, 

which contain the NeuroDl and Masl sequences operably linked to the CMV promoter, 
respectively. 

[29] FIG. 9 is a map of the pBAT16.NGN3 plasmid DNA construct, which contains 

the neurogenein3 sequence operably linked to the CMV promoter. 
[30] FIG. 10 is a graphical representation of the phylogenetic tree of bHLH proteins. 

All sequences are mouse except h.ngn3 which is human. 
[31] FIGS. 1 1 A-l IB is a schematic showing the human neurogenin 3 gene promoter. 

FIG. 1 1 A is a schematic representation of the promoter shows the relative positions of 
the transcription start site and the 5' ends of promoter fragments used in the Examples. 
Some potential sites for transcription factor binding are indicated, including a cluster of 
sites in the distal promoter. The TATAA box (actual sequence GATAA) is shown 30 bp 
upstream of the transcription start site. FIG. 1 IB The DNA sequence between positions 
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-3728 and -3653 of the promoter corresponding to cluster 1 is shown. Again potential 
sites for transcription factor binding are indicated. Two sequences that could potentially 
function as binding sites for homeodomain proteins of the Hox and related classes are 
indicated by dashed lines. 

FIG. 12 is a graph showing the function of the human neurogenin 3 promoter in 
cell lines. Promoter fragments containing sequences extending from the 5' end indicated 
to +261 bp were ligated upstream of the firefly luciferase gene and transfected into the 
cell lines shown. Reporter gene activity is expressed relative to the promoterless 
luciferase vector in the same cell type. Transfections were performed in triplicate on at 
least two occasions and errors are shown as +/- the standard error of the mean. ND, not 
done. 

FIGS. 13A-13C are photographs showing the function of the human neurogenin 
3 promoter in vivo. In FIG. 13 A, 5.7kb of the human Ngn3 promoter was ligated 
upstream of the bacterial p-galactosidase gene and expressed in transgenic mice. The 
staining indicates P-galactosidase catalysis of the X-gal substrate in the labeled cells. 
FIGS. 13B and 13C show immunohistochemical staining (peroxidase detection) for 
glucagon (brown) with enzymatic detection (blue) of P-galactosidase expression is 
shown in the pancreas of E15.5 mouse embryos. P-galactosidase expression is driven by 
-5.7kb or -2.6kb of the human Ngn3 promoter as indicated. X-gal staining was 
performed for 16 hours at 37 (2.6 kb promoter) or at room temperature (5.7 kb 
promoter). Both sections show lOx magnification of comparable sections. 

FIGS. 14A-14B are photographs showing representative cells expressing the 
transgene. FIG. 14A shows a combination of florescent staining (for insulin (red) and 
glucagon (green)), and enzymatic detection of P-galactosidase expressed under the 
control of the 5.7 kb human neurogenin 3 promoter is shown in an El 5.5 mouse 
pancreas. Co-expression of insulin and p-galactosidase in one cell is indicated by the 
arrow. FIG. 14B shows co-detection of Neurogenin3 (peroxidase staining) and P- 
galactosidase (enzymatic detection) is demonstrated in an E15.5 mouse pancreas. The 
arrows indicate cells that are expressing both neurogenin3 and p-galactosidase. 



FIGS. 15A-15C are photographs showing gut expression of neurogenin 3. 
FIG. 15A shows the activity of the p-galactosidase transgene driven by the 5.7kb human 
ngn3 promoter can be detected in scattered cells in the gut epithelium of a El 5.5 mouse 
pancreas. FIG. 15B is a control, showing the lack of similar P-galactosidase expression 
in a wildtype litter mate. FIG. 15C shows detection of Ngn3 (peroxidase detection) in 
some cells also expressing p-galactosidase (filled arrows) and in some cells not 
expressing of p-galactosidase (white arrowhead) in an El 5.5 transgenic mouse gut. 

FIGS. 16A-16B are photographs of electromobility shift assays illustrating HNF3 
binding to the human ngn3 promoter. FIG. 16A shows an electromobility shift assay 
demonstrating the binding of transcription factor HNF3P to the H3-1 probe of the human 
ngn3 promoter. FIG. 16B shows an electromobility shift assay demonstrating that a 
proximal site and a distal site in the human Ngn3 promoter can compete equally for 
binding of HNF3p. 

FIG. 17 is a photograph of a electromobility shift assays showing that HNF1 
binds to the human ngn3 promoter. 

FIGS. 18A-18C are graphs showing that HES1 inhibits the neurogenic 
promoter. FIG. 18A shows results with NIH3T3 cells transfected with the reporter 
construct indicated, and co-transfected with the indicated amount of expression plasmid 
containing the HES1 cDNA ligated downstream of the CMV promoter (vector pBAT12). 
The luciferase activity is expressed relative to the activity in cells transfected with the 
reporter plasmid alone. FIG. 18B shows the relative activity of various fragments of the 
Ngn3 promoter and the Rous Sarcoma virus (RSV) promoter ligated upstream of 
luciferase in the presence or absence of co-transfection with 90ng/million cells of the 
HES1 expression plasmid. FIG. 18C shows fold repression by cotransfected HESlof 
luciferase activity from the indicated promoters in NIH 3T3 cells and TCI. 6 cells.. 
Fold repression is the inverse of relative luciferase activity. Transfections were 
performed in triplicate on at least 3 separate occasions. Errors are shown as +/- the 
standard error of the mean. 



FIG. 19A is a schematic of the DNA sequence immediately upstream of the 
transcription start site (+1) of the human ngn3 gene is shown. Potential HES1 binding 
sites (N boxes) are indicated. 

FIGS. 19B-19C are photographs of electromobility shift assays showing that 
HES1 binds to the human ngn3 promoter. FIG. 19B shows an electromobility shift 
assay demonstrating that bacterially produced HES1 can bind to the N boxes in the 
proximal promoter, as well as to a previously characterized site from the mouse HES 1 
promoter (HIP). Either GST-HES1 or GST protein was incubated with the indicated 
probes. The far right hand lane shows that the HES1 complex is supershifted by addition 
of HES 1 antiserum. FIG. 19C shows an electromobility shift assay demonstrating the 
relative ability of HES 1 binding sites to compete for binding of the GST-HES1 protein. 
All three ngn3 promoter sites bind with higher affinity than the previously described 
HES1 binding site from the HES1 promoter (HIP). In contrast, a 200 fold excess of 
oligonucleotide El (an unrelated E box sequence from the proximal promoter) has no 
effect on complex formation. 

DETAILED DESCRIPTION OF THE INVENTION 

Before the present compositions and methods for islet cell and insulin production 
are described, it is to be understood that this invention is not limited to the particular 
methodology, protocols, cell lines, vectors and reagents described as such may, of 
course, vary. It is also to be understood that the terminology used herein is for the 
purpose of describing particular embodiments only, and is not intended to limit the scope 
of the present invention which will be limited only by the appended claims. 

It must be noted that as used herein and in the appended claims, the singular 
forms "a", "an", and "the" include plural referents unless the context clearly dictates 
otherwise. Thus, for example, reference to "an islet transcription factor" includes a 
plurality of such islet transcription factors and functional equivalents thereof, and 
reference to "the polynucleotide" includes reference to one or more polynucleotides and 
equivalents thereof known to those skilled in the art, and so forth. 



Unless defined otherwise, all technical and scientific terms used herein have the 
same meaning as commonly understood to one of ordinary skill in the art to which this 
invention belongs. Although any methods, devices and materials similar or equivalent to 
those described herein can be used in the practice or testing of the invention, the 
preferred methods, devices and materials are now described. 

All publications mentioned herein are incorporated herein by reference for the 
purpose of describing and disclosing, for example, the cell lines, vectors, and 
methodologies which are described in the publications which might be used in 
connection with the presently described invention. The publications discussed herein are 
provided solely for their disclosure prior to the filing date of the present application. 
Nothing herein is to be construed as an admission that the inventors are not entitled to 
antedate such disclosure by virtue of prior invention. 

Definitions 

"Polynucleotide" as used herein refers to an oligonucleotide, nucleotide, and 
fragments or portions thereof, as well as to peptide nucleic acids (PNA), fragments, 
portions or antisense molecules thereof, and to DNA or RNA of genomic or synthetic 
origin which can be single- or double-stranded, and represent the sense or antisense 
strand. Where "polynucleotide" is used to refer to a specific polynucleotide sequence 
(e.g. a Ngn3 polypeptide-encoding polynucleotide), "polynucleotide" is meant to 
encompass polynucleotides that encode a polypeptide that is functionally equivalent to 
the recited polypeptide, e.g., polynucleotides that are degenerate variants, or 
polynucleotides that encode biologically active variants or fragments of the recited 
polypeptide, including polynucleotides having substantial sequence similarity or 
sequence identity relative to the sequences provided herein. Similarly, "polypeptide 11 as 
used herein refers to an oligopeptide, peptide, or protein. Where "polypeptide 11 is recited 
herein to refer to an amino acid sequence of a naturally-occurring protein molecule, 
"polypeptide" and like terms are not meant to limit the amino acid sequence to the 
complete, native amino acid sequence associated with the recited protein molecule, but 
instead is meant to also encompass biologically active variants or fragments, including 



polypeptides having substantial sequence similarity or sequence identify relative to the 
amino acid sequences provided herein. 

As used herein, " polypeptide" refers to an amino acid sequence of a recombinant 
or nonrecombinant polypeptide having an amino acid sequence of i) a native 
polypeptide, ii) a biologically active fragment of an polypeptide, iii) biologically active 
polypeptide analogs of an polypeptide, or iv) a biologically active variant of an 
polypeptide. Polypeptides useful in the invention can be obtained from any species, e.g., 
mammalian or non-mammalian (e.g., reptiles, amphibians, avian (e.g., chicken)), 
particularly mammalian, including human, rodenti (e.g., murine or rat), bovine, ovine, 
porcine, murine, or equine, preferably rat or human, from any source whether natural, 
synthetic, semi-synthetic or recombinant. For example, "Human Ngn3 polypeptide" 
refers to the amino acid sequences of isolated human Ngn3 polypeptide obtained from a 
human, and is meant to include all naturally-occurring allelic variants, and is not meant 
to limit the amino acid sequence to the complete, native amino acid sequence associated 
with the recited protein molecule. 

As used herein, "antigenic amino acid sequence" means an amino acid sequence 
that, either alone or in association with a carrier molecule, can elicit an antibody 
response in a mammal. 

A "variant" of a polypeptide is defined as an amino acid sequence that is altered 
by one or more amino acids. The variant can have "conservative" changes, wherein a 
substituted amino acid has similar structural or chemical properties, e.g., replacement of 
leucine with isoleucine. More rarely, a variant can have "nonconservative" changes, 
e.g., replacement of a glycine with a tryptophan. Similar minor variations can also 
include amino acid deletions or insertions, or both. Guidance in determining which and 
how many amino acid residues may be substituted, inserted or deleted without 
abolishing biological or immunological activity can be found using computer programs 
well known in the art, for example, DNAStar software. 

By "nucleic acid of interest" is meant any nucleic acid (e.g., DNA) which 
encodes a protein or other molecule which is desirable for administration to a 



mammalian subject. In general, the nucleic acid is operatively linked to other sequences 
which are needed for its expression, such as a promoter. 

A "deletion" is defined as a change in either amino acid or nucleotide sequence in 
which one or more amino acid or nucleotide residues, respectively, are absent as 
compared to an amino acid sequence or nucleotide sequence of a naturally occurring 
polypeptide of interest. 

An "insertion" or "addition" is that change in an amino acid or nucleotide 
sequence which has resulted in the addition of one or more amino acid or nucleotide 
residues, respectively, as compared to an amino acid sequence or nucleotide sequence of 
a naturally occurring polypeptide. 

A "substitution" results from the replacement of one or more amino acids or 
nucleotides by different amino acids or nucleotides, respectively as compared to an 
amino acid sequence or nucleotide sequence of a naturally occurring polypeptide. 

The term "biologically active", for example, refers to human Ngn3 polypeptide 
having structural, regulatory, or biochemical functions of a naturally occurring Ngn3 
polypeptide. Likewise, "immunologically active" defines the capability of the natural, 
recombinant or synthetic human Ngn3 polypeptide, or any oligopeptide thereof, to 
induce a specific immune response in appropriate animals or cells and to bind with 
specific antibodies. 

The term "derivative" as used herein refers to the chemical modification of a 
nucleic acid encoding a polypeptide or the encoded polypeptide. Illustrative of such 
modifications would be replacement of hydrogen by an alkyl, acyl, or amino group. A 
nucleic acid derivative would encode a polypeptide which retains essential biological 
characteristics of a natural polypeptide. 

As used herein the term "isolated" is meant to describe a compound of interest 
(e.g., either a polynucleotide or a polypeptide) that is in an environment different from 
that in which the compound naturally occurs. "Isolated" is meant to include compounds 
that are within samples that are substantially enriched for the compound of interest 
and/or in which the compound of interest is partially or substantially purified. 



[56] As used herein, the term "substantially purified" refers to a compound (e.g., 

either a polynucleotide or a polypeptide) that is removed from its natural environment 
and is at least 60% free, preferably 75% free, and most preferably 90% free from other 
components with which it is naturally associated. 

[57] By "transformation" or "transfection" is meant a permanent or transient genetic 

change, preferably a permanent genetic change, induced in a cell following incorporation 
of new nucleic acid (e.g., DNA or RNA exogenous to the cell). Genetic change can be 
accomplished either by incorporation of the new nucleic acid into the genome of the host 
cell, or by transient or stable maintenance of the new DNA as an episomal element. 

[58] By "transformed cell" is meant a cell into which (or into an ancestor of which) 

has been introduced, by means of recombinant DNA techniques, a DNA molecule 
q encoding a protein of interest. 

^ [59] By "construct" is meant a recombinant nucleic acid, generally recombinant DNA, 

^ that has been generated for the purpose of the expression of a specific nucleotide 

y sequence(s), or is to be used in the construction of other recombinant nucleotide 

IJI 

q sequences. 

[60] By "vector" is meant any compound, biological or chemical, which facilitates 

W transformation of a target cell with a DNA of interest. Exemplary biological vectors 

III 

include viruses, particularly attenuated and/or replication-deficient viruses. Exemplary 
chemical vectors include lipid complexes and naked DNA constructs. 
[61] By "naked DNA" or "naked nucleic acid" or DNA sequence and the like is meant 

a nucleic acid molecule that is not contained within a viral particle, bacterial cell or other 
encapsulating means that facilitates delivery of nucleic acid into the cytoplasm of the 
target cell. Naked nucleic acid can optionally be associated (e. g. formulated) with 
means for facilitating delivery of the nucleic acid to the site of the target cell (e.g., means 
that facilitate travel into the cell, protect the nucleic acid from nuclease degradation, and 
the like) and/or to the surface of the target epithelial cell. 
[62] By "promoter" is meant a minimal sequence sufficient to direct transcription. 

"Promoter" is also meant to encompass those promoter elements sufficient for promoter- 
dependent gene expression controllable for cell-type specific, tissue-specific or inducible 
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by external signals or agents; such elements may be located in the 5' or 3 ! regions of the 
native gene. 

By "operably linked" or "operatively linked" is meant that a DNA sequence and a 
regulatory sequence(s) are connected in such a way as to permit expression when the 
appropriate molecules (e.g., transcriptional activator proteins) are bound to the 
regulatory sequence(s). 

By "pancreas" is meant a large, elongated, racemose gland situated transversely 
behind the stomach, between the spleen and the duodenum. The pancreas is composed of 
an endocrine portion (the pars endocrina) and an exocrine portion (the pars exocrina). 
The pars endocrina, which contains the islets of Langerhans, produces and secretes 
proteins, including insulin, directly into the blood stream. The pars exocrina contains 
secretory units and produces and secretes a pancreatic juice, which contains enzymes 
essential to protein digestion, into the duodenum. 

By "euglycemia" or "euglycemic state" is meant a state associated with a level of 
blood glucose that is normal or nearly normal, particularly relative to the levels of blood 
glucose in a subject having a disease or condition associated with hyperglycemia. In 
humans, euglycemia correlates with blood glucose levels in the range of 70 mg/dl to 130 
mg/dl. 

By "precursor cell" is meant any cell which is capable of developing into an islet 
cell such as fetal pancreatic epithelial cells, adult pancreatic cells (e.g., pancreatic duct 
cells, acinar cells and pancreatic stem cells), gut epithelial cells gut stem cells or crypt 
cells, stem cells from other tissues (such as hematopoietic or hepatic stem cells), hepatic 
cells (hepatocytes and hepatic duct cells and embryonic stem cells in vivo or in vitro. 

By "target cell" is meant any cell selected for incorporating DNA encoding a 
transcription factor nucleotide sequence such as precursor cells, fetal pancreatic 
epithelial cells, adult pancreatic cells (e.g., pancreatic duct cells, acinar cells and 
pancreatic stem cells), gut epithelial cells gut stem cells or crypt cells, stem cells from 
other tissues (such as hematopoietic or hepatic stem cells), hepatic cells (hepatocytes and 
hepatic duct cells and embryonic stem cells in vivo or in vitro. 
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[68] By "islet transcription factor" is meant any transcription factor involved in the 

differentiation and/or development of islet cells, the expression of which contributes to 
the production of a cell having an islet cell phenotype, e.g. , a cell that produces insulin or 
other markers characteristic of islet cells, as well as functionally equivalent homologues. 
Of particular interest are the class B basic helix-loop-helix (bHLH) transcription factors 
involved in the development of islet cells, which include the neurogenins (neurogeninl, 
neurogenic and neurogenin3), the neuroD factors (NeuroDl/BETA2, neuroD2, and 
NeuroD4/Math3) and the Mash factor, Mashl, as well as functionally equivalent 
homologues of these transcription factors. . 

[69] By M neurogenin3 (Ngn3) regulatory pathway" is meant a transcriptionally-related 

group of proteins, including transcription factors, that regulate the development of 
£3 pancreatic islet cells and determine the phenotype of the islet cells. A positive regulator 

of the Ngn3 pathway is one that promotes expression of Ngn3 or its downstream 

z .TZf 

^ effectors and thus positively affects the induction of the islet cell phenotype. A negative 

y regulator of the Ngn3 pathway is one that inhibits expression of Ngn3 of its downstream 

j^" effectors, and thus negatively affects in the induction of the islet cell phenotype. 

1: Positive regulators include factors that inhibit activity or expression of negative 

y regulators. 

13 [70] By "islet cell" is meant a cell having a phenotype similar to the hormone- 

producing cells normally comprising the pancreatic islets of Langerhans, and generally 
characterized by the expression of markers that normally distinguishing the cells in the 
pancreatic islets of Langerhans from other pancreatic cells, such as insulin, glucagon, 
somatostatin, pancreatic polypeptide, or islet amyloid polypeptide. 
[71] By "P cell" is meant a pancreatic islet cell having a phenotype characterized by 

the expression of markers that normally distinguish the beta-cells from the other 
pancreatic islets cells, such as insulin, Nkx6.1 or glucokinase. 
[72] By "a cell" is meant a pancreatic islet cell having a phenotype characterized by 

the expression of markers that normally distinguish the ot-cells from the other pancreatic 
islets cells, such as proglucagon or glucagons. 
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[73] By "target cell" is meant a cell in which, for example, introduction of a nucleic 

acid of interest of expression is desired. The use of "target cell" throughout the 
specification is for convenience only, and is not meant to imply that, for example, 
accomplishing introduction of a nucleic acid of interest requires the use of targeting 
techniques {e.g., targeting molecules that preferentially direct the material to be 
introduced to a particular cell or cell type). 

[74] By "subject" or "patient" is meant any mammalian subject for whom diagnosis or 

therapy is desired, particularly humans. Other subjects may include cattle, dogs, cats, 
guinea pigs, rabbits, rats, mice, horses, and so on. Of particular interest are subjects 
having an insulin-associated disorder that is amenable to treatment (e.g., to mitigate 
symptoms associated with the disorder) by expression of either a islet transcription factor 
-encoding nucleic acid in a cell of the subject (e.g., by introduction of a islet 
transcription factor -encoding nucleic acid into the subject in vivo, or by implanting cells 
expressing a islet transcription factor (e.g., p-cell precursors) or nearly developed or 
mature p-cells cultured from cells expressing a islet transcription factor into the subject, 
which cells produce insulin). 

[75] For sake of clarity, the following table provides a cross-reference of names of 

transcription factors as used in the present specification with names of the same factors 
as referred to in the literature as well as the official name of the corresponding human 
gene. 



Name as used herein 


Other names in the literature 


Official human gene name* 


neurogeninl 


ngnl; amth4C; neuroD3 


NEUROG1 


neurogenin2 


ngn2; math4A; ATOH4 


NEUROG2 


neurogenin3 


ngn3; math4B; ATOH5; relax 


NEUROG3 


neuroDl 


BETA2; BHF1; neuroD 


NEUROD1 


neuroD2 


NDRF; rat4; KW8 


NEUROD2 


neuroD4 


math3; neuroM; ath3; ATOH3 


NEUROD4 


math2 


ath2; nexl; dlx3; atoh6 


NEUROD6 


mashl 


ASH 1; HASH 1 


ASCL1 


mash2 


ASH2; HASH2 


ASCL2 


mathl 


HATHl 


ATOH1 



*As accepted by the Human Gene Nomenclature Committee. 
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Overview of the Invention 

The present invention is based upon the discovery that the introduction of a 
polynucleotide sequence encoding an islet transcription factor into a pancreatic cell or 
other appropriate cell induces the production of cells having the phenotype of pancreatic 
islet cells, including insulin-producing p-cells. This discovery in turn is based on the 
discovery that providing for increased neurogenic (Ngn3) activity in a mature 
pancreatic cell (a non-beta cell), provides for development of the non-beta pancreatic cell 
into a cell with the pancreatic beta cell phenotype (e.g., production of insulin). 

An increase in Ngn3 activity can be accomplished by, for example, introducing 
an Ngn3 -encoding polynucleotide into a cell to provide for Ngn3 expression (which may 
be in addition to endogenous Ngn3 expression in the cell); providing for increased levels 
of expression of a positive regulator of Ngn3 (e.g., by introducing a polynucleotide 
encoding a transcription factor that positively regulates Ngn3 expression, or otherwise 
increasing activity or expression of such Ngn3 positive regulators); inhibiting activity 
(e.g., by inhibiting expression) of a negative regulator or inhibitor of Ngn3 expression or 
activity); increasing expression of a downstream effector which is positively regulated 
by Ngn3; and other variations that will be readily apparent to the ordinarily skilled 
artisan upon reading the present specification. Modulating of transcription factor 
activity (e.g., increasing Ngn3 activity or decreasing activity of an inhibitor of Ngn3 
expression) can also be accomplished by use of signaling molecules (receptors, ligands, 
intracellular effectors), as well as synthetic and natural small molecule regulators of the 
pathway. 

The invention generally involves providing for increased expression of at least 
one islet transcription factor selected from the neurogenic basic helix-loop-helix factors 
(bHLH) including the neurogenins (neurogeninl/NEUROGl/MATH4C/NeuroD3, 
neurogenin2/NEUROG2/MATH4A or neurogenin3/NEUROG3/MATH4B), the neuroD 
factors (NeuroD 1 /BET A2/BHF 1 , NeuroD2/NDRF, M ATH2/NEX 1 /DLX3 , 
NeuroD4/Math3), the Mash factors (Mashl and Mash2), and the atonal -related factors 
(MATH1/ATOH1), as well as combinations thereof or combinations with other genes, to 
provide for induction of pancreatic beta cells. 



FIG. 10 is a schematic representation of an alignment tree for neuroendorcrine 
bHLH proteins, plus myoD (a myogenic class B bHLH protein, and E47 (class A 
(ubiquitous) bHLH protein). All are based on mouse amino acid sequences. The 
sequences were aligned using the multiple sequence alignment algorithm Clustal-W as 
supplied in the MacVector6.5.1 sequence analysis program (Oxford Molecular). The 
definition of class A and B is based on the classification of Murre et al. (Murre, et al. 
(1989) Cell 58(3), 537-44). 

Islet transcription factors are involved in the differentiation and development of 
islet cells. Islet transcription factors include members of the class B basic helix-loop- 
helix (bHLH) family of transcription factors, a family of factors known to regulate 
growth and differentiation of numerous cell types. Islet cells and the developing 
pancreas express a broad group of class B bHLH genes, among the most abundant being 
Ngn3, NeuroDl/BETA2, Mashl and NeuroD4/Math3. NeuroDl is also known as 
BETA2 and has been shown to be involved in the early differentiation of islet cells and 
the regulation of insulin transcription in pancreatic beta cells. Neurogenin3 activates 
NeuroDl /BET A2 during pancreatic development and therefore neurogenin3 lies 
upstream of neuroDl in the hierarchy of islet transcription factors activated during islet 
cell differentiation. Ngn3 is expressed in islet cell progenitors and functions as a pro- 
endocrine gene, driving islet cell differentiation. Ngn3 is expressed early on in the 
development of all four islet cell types and is involved in the regulation of other islet 
transcription factors such as Pax4 and Nkx2.2 as well as NeuroDl/BETAl. Early and 
ectopic expression of Ngn3 can cause early and ectopic differentiation of islet cells. 
Other islet transcription factors also include non-bHLH factors such as the homeodomain 
factors, e.g. Nkx2.2 and Nkx6.1. These factors are immediately upstream or downstream 
of ngn3 and are involved in islet cell development. The pou-homeodomain factor HNF1 
and the winged-helix factor HNF3 lie upstream of ngn3, and along with the cut- 
homeodomain factor HNF6 have been implicated in islet cell differentiation and are 
further examples of islet transcription factors in accordance to the present invention. 

While Ngn3 is referred to throughout the specification, such reference is not 
intended to be limiting. Rather Ngn3 is only exemplary of islet transcription factors 



useful in the invention, and reference to it alone is for clarity and ease in review of the 
specification. 

Induction of Beta-Cell Development 

Pancreatic beta-cells can be produced from non-beta cell pancreatic cells by 
providing for production of an islet transcription factor in a pancreatic cell either in vivo 
(e.g., by administration of islet transcription factor-encoding nucleic acid (e.g., RNA or 
DNA) to the pancreas of a subject, e.g., by introduction of nucleic acid into a lumen of a 
pancreatic duct), or in vitro, e.g., by contacting a target cell (e.g., an isolated, non-beta, 
pancreatic cell) with islet transcription factor-encoding nucleic acid (e.g., RNA or DNA) 
in culture (which cells are then cultured, expanded, and transplanted into a subject). 

In one embodiment of particular interest, beta cells are produced by providing for 
expression of neurogenin3 (Ngn3) at a level sufficient to induce the beta cell phenotype 
in the target cell. Expression of Ngn3 in the target cell can be accomplished in a variety 
of ways. For example, in one embodiment, Ngn3 expression is accomplished by 
introduction of Ngn3-encoding nucleic acid (e.g., DNA or RNA) to provide for 
expression of the encoded Ngn3 polypeptide in the target cell). In another embodiment 
Ngn3 expression is induced by introduction of a gene encoding a protein that provides 
for induction of Ngn3 expression (e.g., expression of an "upstream" positive regulator of 
Ngn3 expression in the target cell). In another embodiment, Ngn3 expression is 
accomplished by introduction of a gene encoding a protein that inhibits activity (e.g., 
function or expression) a negative regulator of Ngn3 expression. In another embodiment 
Ngn3 expression is induced by introduction of a small molecule that provides for 
induction of Ngn3 expression (e.g., a small molecule pharmaceutical that induces Ngn3 
expression in the target cell). In addition, production of pancreatic beta cells of the 
invention can also be accomplished by providing for production of factors induced by 
Ngn3. 

As will be readily appreciated by the ordinarily skilled artisan upon reading the 
present disclosure, Ngn3 expression can be accomplished by providing for any 
combination of these approaches. For example, the invention also encompasses 
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providing for expression in the target cell of both an Ngn3 -encoding nucleic acid as well 
as a positive regulator of an endogenous Ngn3 gene; providing for expression of an 
introduced Ngn3 nucleic acid as well as an inhibitor of a negative regulator of an 
endogenous Ngn3 and introduced Ngn3 sequence; and the like. In general, any 
combination of the approaches that provide for Ngn3 activity by, for example, providing 
for expression of Ngn3 per se (by introduction of Ngn3-encoding nucleic acid or 
providing for expression of endogenous Ngn3) and/or by providing of production of 
factors "downstream" of Ngn3 that are normally produced as a result of Ngn3 
expression, are within the scope of the present invention. Positive regulators of Ngn3 
expression include, but are not necessarily limited to HNF1, HNF3, and HNF6. 

In general, factors that provide for production of Ngn3 activity in a target cell are 
referred to herein as "islet transcript factors." As noted above "islet transcription factor" 
is meant any transcription factor involved in the differentiation and/or development of 
islet cells, the expression of which contributes to the production of a cell having an islet 
cell phenotype, e.g., a cell that produces insulin or other markers characteristic of islet 
cells, as well as functionally equivalent homologues. Of particular interest are the class 
B basic helix-loop-helix (bHLH) transcription factors involved in the development of 
islet cells, which include the neurogenins (neurogeninl, neurogenin2 and neurogenin3), 
the neuroD factors (NeuroD 1 /BETA2, neuroD2, and NeuroD4/Math3) and the Mash 
factor, Mashl, as well as functionally equivalent homologues of these transcription 
factors. Reference to Ngn3 herein is for clarity, and is not meant to be limiting, but 
rather to provide a reference point in a regulatory pathway that leads to pancreatic beta 
cell development. 

In addition, induction of the activity of the Ngn3 pathway can be accomplished 
using naturally occurring or synthetic molecules other than nucleic acid. For example, 
Ngn3 activity can be induced by using a synthetic molecule that promotes Ngn3 
expression, e.g., by inhibiting activity of a negative regulator of Ngn3 expression. 
Inhibitory transcription factors of Ngn3 expression include, but are not necessarily 
limited to HES1. Negative signally pathways that inhibit Ngn3 expression include, but 
are not necessarily limited to, the Notch pathway. 



Islet Transcription Factor Nucleic Acids 

The term "islet transcription factor gene" is used to designate both transcription 
factors that are expressed in pancreatic islet cells, and also transcription factors that are 
involved in the development, differentiation, or formation of islet cells. The term "islet 
transcription factor gene" is also intended to mean the open reading frame encoding 
specific islet transcription factor polypeptides, introns, and adjacent 5' and 3' non-coding 
nucleotide sequences involved in the regulation of expression, up to about 10 kb beyond 
the coding region, but possibly further in either direction. The DNA sequences encoding 
an islet transcription factor may be cDNA or genomic DNA or a fragment thereof. The 
gene may be introduced into an appropriate vector for extrachromosomal maintenance or 
for integration into the host. 

The term "cDNA" as used herein is intended to include all nucleic acids that 
share the arrangement of sequence elements found in native mature mRNA species, 
where sequence elements are exons (e.g., sequences encoding open reading frames of the 
encoded polypeptide) and 3' and 5' non-coding regions. Normally mRNA species have 
contiguous exons, with the intervening introns removed by nuclear RNA splicing, to 
create a continuous open reading frame encoding the polypeptide of interest. 

A islet transcription factor genomic sequence of interest comprises the nucleic 
acid present between the initiation codon and the stop codon, as defined in the listed 
sequences, including all of the introns that are normally present in a native chromosome. 
It may further include the 3' and 5' untranslated regions found in the mature mRNA. It 
may further include specific transcriptional and translational regulatory sequences, such 
as promoters, enhancers, etc., including about 10 kb, but possibly more, of flanking 
genomic DNA at either the 5' or 3' end of the transcribed region. The genomic DNA 
may be isolated as a large fragment of 100 kbp or more, or as a smaller fragment 
substantially free of flanking chromosomal sequence. 

The sequence of this 5' region, and further 5 f upstream sequences and 3' 
downstream sequences, may be utilized for promoter elements, including enhancer 
binding sites, that provide for expression in tissues where the islet transcription factor is 



expressed. The sequences of the islet transcription factor promoter elements of the 
invention can be based on the nucleotide sequences of any species (e.g., mammalian or 
non-mammalian (e.g., reptiles, amphibians, avian (e.g., chicken)), particularly 
mammalian, including human, rodenti (e.g., murine or rat), bovine, ovine, porcine, 
murine, or equine, preferably mouse or human) and can be isolated or produced from any 
source whether natural, synthetic, semi-synthetic or recombinant. 

The nucleic acid compositions used in the subject invention may encode all or a 
part, usually at least substantially all, of the islet transcription factor polypeptides as 
appropriate. Fragments may be obtained of the DNA sequence by chemically 
synthesizing oligonucleotides in accordance with conventional methods, by restriction 
enzyme digestion, by PCR amplification, etc. For the most part, DNA fragments will be 
of at least about ten contiguous nucleotides, usually at least about 1 5 nt, more usually at 
least about 18 nt to about 20 nt, more usually at least about 25 nt to about 50 nt. Such 
small DNA fragments are useful as primers for PCR, hybridization screening, etc. 
Larger DNA fragments, i.e. greater than 100 nt are useful for production of the encoded 
polypeptide. For use in amplification reactions, such as PCR, a pair of primers will be 
used. The exact composition of the primer sequences is not critical to the invention, but 
for most applications the primers will hybridize to the subject sequence under stringent 
conditions, as known in the art. It is preferable to choose a pair of primers that will 
generate an amplification product of at least about 50 nt, preferably at least about 100 nt. 
Algorithms for the selection of primer sequences are generally known, and are available 
in commercial software packages. Amplification primers hybridize to complementary 
strands of DNA, and will prime towards each other. 

The islet transcription factor genes are isolated and obtained in substantial purity, 
generally as other than an intact mammalian chromosome. Usually, the DNA will be 
obtained substantially free of other nucleic acid sequences that do not include a sequence 
encoding an islet transcription factor or fragment thereof, generally being at least about 
50%, usually at least about 90% pure and are typically "recombinant", i.e. flanked by 
one or more nucleotides with which it is not normally associated on a naturally occurring 
chromosome. 



[93] The sequence of the islet transcription factor, including flanking promoter 

regions and coding regions, may be mutated in various ways known in the art to generate 
targeted changes in promoter strength, sequence of the encoded protein, etc. The DNA 
sequence or product of such a mutation will be substantially similar to the sequences 
provided herein, Le. will differ by at least one nucleotide or amino acid, respectively, and 
may differ by at least two, or by at least about ten or more nucleotides or amino acids. In 
general, the sequence changes may be substitutions, insertions or deletions. Deletions 
may further include larger changes, such as deletions of a domain or exon. It should be 
noted that islet transcription factor sequences are conversed mainly within the bHLH 
domain, and regions outside this domain may not be as well-conserved, and may even be 
remarkably poorly conserved, between, for example, rat, mouse, and humans. Thus islet 
transcription factors can tolerate more nucleotide and amino acid residue changes outside 
of the bHLH domain and retain function to a much greater extent than changes made 
within the bHLH domain. Such modified islet transcription factor sequences can be 
used, for example, to generate vectors for introduction into target cells for the purpose of 
producing islet cells. 

[94] Techniques for in vitro mutagenesis of cloned genes are known. Examples of 

protocols for scanning mutations may be found in Gustin et al., 1993 Biotechniques 
14:22 ; Barany, 1985 Gene 37:1 1 1-23; Colicelli et al., 1985 Mol Gen Genet 199:537-9; 
and Prentki et al., 1984 Gene 29:303-13. Methods for site specific mutagenesis can be 
found in Sambrook et al., 1989 Molecular Cloning: A Laboratory Manual, CSH Press, 
pp. 15.3-15.108; Weiner et al., 1993 Gene 126:35-41; Sayers et al., 1992 Biotechniques 
13:592-6; Jones and Winistorfer, 1992 Biotechniques 12:528-30; Barton et al., 1990 
Nucleic Acids Res 18:7349-55; Marotti and Tomich, 1989 Gene Anal Tech 6:67-70; and 
Zhu 1989 Anal Biochem 177:120-4. 

[95] An islet transcription factor of particular interest in the present invention is a 

member of the neurogenin transcription factor family, e.g., neurogenin 1 (Ngnl), 
neurogenin2 (Ngn2), neurogenin 3 (Ngn3), with Ngn3 being of particular interest. The 
nucleotide and amino acid sequences of human Ngn3 are provided in the Sequence 
Listing as SEQ ED NOS: 1 and 2, respectively. The nucleotide and amino acid 

-22- 



sequences of human Ngnl are available at GenBank accession number XM__003834 and 
NM_006161 . The nucleotide and amino acid sequence of human Ngn2 are available at 
GenBank accession number AF303002. 
[96] It should be noted that transcription factors which act either "upstream" of ngn3 

(and therefore activate ngn3 expression) or "downstream" of Ngn3, that lead to 
development of the islet cell phenotype, are also contemplated for use in the present 
invention. 

[97] Neurogenin3 by itself, is sufficient to force undifferentiated pancreatic epithelial 

cells to become islet cells. Since neurogenic expression determines which precursor 
cells will differentiate into islet cells, the signals that regulate neurogenic expression 
are also involved in islet cell formation. Although 2.7 kb of the ngn3 promoter is 
sufficient to direct expression correctly in transgenic mice, distal sequences have been 
shown to greatly enhance the expression of ngn3. This distal promoter region contains a 
cluster of binding sites for pancreatic transcription factors such as, HNF6, HNFla, and 
HNF3p. These pancreatic transcription factors have been found to regulate ngn3 gene 
expression and thereby are also involved in the control of islet cell formation. These 
signals may be useful in generating new islet cells for patients with diabetes mellitus. 

[98] In another embodiment, the islet transcription factor is human NeuroDl/BETA2 

gene, which is available, with the corresponding human NeuroDl/BETA2 amino acid 
sequence, at GenBank accession number NM_002500. . 

[99] The human Mashlgene and the corresponding amino acid sequence are available 

at GenBank accession number XM_006688. 

[100] The human NeuroD4/Math3 gene and the corresponding human NeuroD4/Math3 

amino acid sequence are available at GenBank accession number AF203901. 

Constructs for Delivery of Islet Transcription Factor Nucleic Acid 
[101] Where the islet transcription factor nucleic acid to be delivered is DNA, any 

construct having a promoter (e.g., a promoter that is functional in a eukaryotic cell) 
operably linked to a DNA of interest can be used in the invention. The constructs 
containing the DNA sequence (or the corresponding RNA sequence) which may be used 
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in accordance with the invention may be any eukaryotic expression construct containing 
the DNA or the RNA sequence of interest. For example, a plasmid or viral construct 
(e.g. adenovirus) can be cleaved to provide linear DNA having ligatable termini. These 
termini are bound to exogenous DNA having complementary-like ligatable termini to 
provide a biologically functional recombinant DNA molecule having an intact replicon 
and a desired phenotypic property. Preferably the construct is capable of replication in 
eukaryotic and/or prokaryotic hosts (viruses in eukaryotic, plasmids in prokaryotic), 
which constructs are known in the art and are commercially available. 

[102] The constructs can be prepared using techniques well known in the art. 

Likewise, techniques for obtaining expression of exogenous DNA or RNA sequences in 
a genetically altered host cell are known in the art (see, for example, Kormal et al., Proc. 
Natl. Acad. Sci. USA, 84:2150-2154, 1987; Sambrook et al. Molecular Cloning: a 
Laboratory Manual, 2nd Ed., 1989, Cold Spring Harbor Laboratory Press, Cold Spring 
Harbor, NY, each of which are hereby incorporated by reference with respect to methods 
and compositions for eukaryotic expression of a DNA of interest). 

[103] In one embodiment, the DNA construct contains a promoter to facilitate 

expression of the DNA of interest within a pancreatic cell. The promoter may be a 
strong, viral promoter that functions in eukaryotic cells such as a promoter from 
cytomegalovirus (CMV), mouse mammary tumor virus (MMTV), Rous sarcoma virus 
(RSV), or adenovirus. More specifically, exemplary promoters include the promoter 
from the immediate early gene of human CMV (Boshart et al, Cell 41:521-530, 1985) 
and the promoter from the long terminal repeat (LTR) of RSV (Gorman et al., Proc. 
Natl Acad. Sci. USA 79:6777-6781, 1982). Of these two promoters, the CMV promoter 
is presently preferred as it provides for higher levels of expression than the RSV 
promoter. 

[104] Alternatively, the promoter used may be a strong general eukaryotic promoter 

such as the actin gene promoter. In one embodiment, the promoter used may be a tissue- 
specific promoter. For example, the promoter used in the construct may be a pancreas 
specific promoter, a duct cell specific promoter or a stem cell specific promoter. The 
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constructs of the invention may also include sequences in addition to promoters which 
enhance expression in the target cells. 

In another embodiment, the promoter is a regulated promoter, such as a 
tetracycline-regulated promoter, expression from which can be regulated by exposure to 
an exogenous substance (e.g., tetracycline.). 

Other components such as a marker (e.g., an antibiotic resistance gene (such as 
an ampicillin resistance gene) or p-galactosidase) aid in selection or identification of 
cells containing and/or expressing the construct, an origin of replication for stable 
replication of the construct in a bacterial cell (preferably, a high copy number origin of 
replication), a nuclear localization signal, or other elements which facilitate production 
of the DNA construct, the protein encoded thereby, or both. 

For eukaryotic expression, the construct should contain at a minimum a 
eukaryotic promoter operably linked to a DNA of interest, which is in turn operably 
linked to a polyadenylation signal sequence. The polyadenylation signal sequence may 
be selected from any of a variety of polyadenylation signal sequences known in the art. 
An exemplary polyadenylation signal sequence is the SV40 early polyadenylation signal 
sequence. The construct may also include one or more introns, where appropriate, which 
can increase levels of expression of the DNA of interest, particularly where the DNA of 
interest is a cDNA (e.g., contains no introns of the naturally-occurring sequence). Any 
of a variety of introns known in the art may be used (e.g.., the human P-globin intron, 
which is inserted in the construct at a position 5' to the DNA of interest). 

In an alternative embodiment, the nucleic acid delivered to the cell is an RNA 
encoding an islet transcription factor. In this embodiment, the RNA is adapted for 
expression (i.e., translation of the RNA) in a target cell. Methods for production of 
RNA (e.g., mRNA) encoding a protein of interest are well known in the art, and can be 
readily applied to the product of RNA encoding islet transcription factors useful in the 
present invention. 



□ 



Delivery of Islet Transcription Factor-Encoding Nucleic Acid 
[109] Delivery of islet transcription factor-encoding nucleic acid can be accomplished 

using a viral or a non- viral vector. In one embodiment the nucleic acid is delivered 
within a viral particle, such as an adenovirus. In another embodiment, the nucleic acid is 
delivered in a formulation comprising naked DNA admixed with an adjuvant such as 
viral particles (e.g., adenovirus) or cationic lipids or liposomes. An "adjuvant" is a 
substance that does not by itself produce the desired effect, but acts to enhance or 
otherwise improve the action of the active compound. The precise vector and vector 
formulation used will depend upon several factors, such as the size of the DNA to be 
transferred, the delivery protocol to be used, and the like. Exemplary non- viral and viral 
vectors are described in more detail below. 
I. Viral vectors 



[110] In general, viral vectors used in accordance with the invention are composed of a 

H* viral particle derived from a naturally-occurring virus which has been genetically altered 

to render the virus replication-defective and to deliver a recombinant gene of interest for 



expression in a target cell in accordance with the invention. 

[Ill] Numerous viral vectors are well known in the art, including, for example, 

retrovirus, adenovirus, adeno-associated virus, herpes simplex virus (HSV), 
cytomegalovirus (CMV), vaccinia and poliovirus vectors. Adenovirus and AAV are 
usually preferred viral vectors since these viruses efficiently infect slowly replicating 
and/or terminally differentiated cells. The viral vector may be selected according to its 
preferential infection of the cells targeted. 

[112] Where a replication-deficient virus is used as the viral vector, the production of 

infectious virus particles containing either DNA or RNA corresponding to the DNA of 
interest can be achieved by introducing the viral construct into a recombinant cell line 
which provides the missing components essential for viral replication. In one 
embodiment, transformation of the recombinant cell line with the recombinant viral 
vector will not result in production or substantial production of replication-competent 
viruses, e.g., by homologous recombination of the viral sequences of the recombinant 
cell line into the introduced viral vector. Methods for production of replication-deficient 
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viral particles containing a nucleic acid of interest are well known in the art and are 
described in, for example, Rosenfeld et al., Science 252:431-434, 1991 and Rosenfeld et 
al., Cell 68:143-155, 1992 (adenovirus); USPN 5,139,941 (adeno-associated virus); 
USPN 4,861,719 (retrovirus); and USPN 5,356,806 (vaccinia virus). Methods and 
materials for manipulation of the mumps virus genome, characterization of mumps virus 
genes responsible for viral fusion and viral replication, and the structure and sequence of 
the mumps viral genome are described in Tanabayashi et al., J. Virol. 67:2928-2931, 
1993; Takeuchi et a\.,Archiv. Virol, 128:177-183, 1993; Tanabayashi et al., Virol 
187:801-804, 1992; Kawano et al., Virol. , 179:857-861, 1990; Elango et al., J. Gen. 
Virol. 69:2893-28900, 1988. 
II. Non- viral vectors 

The nucleic acid of interest may be introduced into a cell using a non-viral 
vector. "Non-viral vector" as* used herein is meant to include naked DNA (e.g., DNA not 
contained within a viral particle, and free of a carrier molecules such as lipids), chemical 
formulations comprising naked nucleic acid (e.g., a formulation of DNA (and/or RNA) 
and cationic compounds (e.g., dextran sulfate, cationic lipids)), and naked nucleic acid 
mixed with an adjuvant such as a viral particle (e.g., the DNA of interest is not contained 
within the viral particle, but the formulation is composed of both naked DNA and viral 
particles (e.g., adenovirus particles) (see, e.g., Curiel et al. 1992 Am. J. Respir. Cell Mol. 
Biol. 6:247-52). Thus "non-viral vector" can include vectors composed of nucleic acid 
plus viral particles where the viral particles do not contain the DNA of interest within the 
viral genome. 

In one embodiment, the formulation comprises viral particles which are mixed 
with the naked DNA construct prior to administration. About 10 s to about 10 10 viral 
particles (preferably about 1 X 10'° to about 5 X10 10 , more preferably about 3 X 10 10 
particles) are mixed with the naked DNA construct (about 5 \ig to 50 jig DNA, more 
preferably about 8 |ig to 25 \ig DNA) in a total volume of about 100 Preferably the 
viral particles are adenovirus particles (Curiel et al., 1992 supra). 

Alternatively or in addition, the nucleic acid can be complexed with polycationic 
substances such as poly-L-lysine or DEAC-dextran, targeting ligands, and/or DNA 



binding proteins (e.g., histones). DNA- or RNA-liposome complex formulations 
comprise a mixture of lipids which bind to genetic material (DNA or RNA) and facilitate 
delivery of the nucleic acid into the cell. Liposomes which can be used in accordance 
with the invention include DOPE (dioleyl phosphatidyl ethanol amine), CUDMEDA (N- 
(5-cholestrum-3-.beta.-ol 3-urethanyl)-N',N'-dimethylethylene diamine). 

For example, the naked DNA can be administered in a solution containing 
Lipofectin™ (LTI/BRL) at a concentrations ranging from about 2.5% to 15% volume: 
volume, preferably about 6% to 12% volume: volume. Preferred methods and 
compositions for formulation of DNA for delivery according to the method of the 
invention are described in U.S. Pat. No. 5,527,928, the disclosure of which is 
incorporated herein by reference. 

The nucleic acid of interest can also be administered as a chemical formulation of 
DNA or RNA coupled to a carrier molecule (e.g., an antibody or a receptor ligand) 
which facilitates delivery to host cells for the purpose of altering the biological 
properties of the host cells. By the term "chemical formulations" is meant modifications 
of nucleic acids which allow coupling of the nucleic acid compounds to a carrier 
molecule such as a protein or lipid, or derivative thereof. Exemplary protein carrier 
molecules include antibodies specific to the cells of a targeted pancreatic cell or receptor 
ligands, e.g., molecules capable of interacting with receptors associated with a cell of a 
targeted pancreatic cell. 

Production of Islet Transcription Factor Polypeptides and Antibodies That Specifically 
Bind Such Polypeptides 

Nucleic acid encoding Ngn3 or other islet transcription factors of interest may be 
employed to synthesize full-length polypeptides or fragments thereof, particularly 
fragments corresponding to functional domains; DNA binding sites; eta ; and including 
fusions of the subject polypeptides to other proteins or parts thereof. Accordingly, the 
polynucleotides and polypeptides suitable for use in the invention include, without 
limitation, islet transcription factor polypeptides and polynucleotides found in primates, 
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rodents, canines, felines, equines, nematodes, yeast and the like, and the natural and 
non-natural variants thereof. 

The islet transcription factor polypeptides can be used for the production of 
antibodies, where short fragments provide for antibodies specific for the particular 
polypeptide, and larger fragments or the entire protein allow for the production of 
antibodies over the surface of the polypeptide. Antibodies may be raised to the wild- 
type or variant forms of the polypeptide. Antibodies may be raised to isolated peptides 
corresponding to these domains, or to the native protein, e.g. by immunization with cells 
expressing the polypeptide of interest, immunization with liposomes having a 
polypeptide of interest inserted in the membrane, etc. 

Antibodies are prepared in accordance with conventional ways, where the 
expressed polypeptide or protein is used as an immunogen, by itself or conjugated to 
known immunogenic carriers, e.g. KLH, pre-S HBsAg, other viral or eukaryotic 
proteins, or the like. For further description, see Monoclonal Antibodies: A Laboratory 
Manual, Harlow and Lane eds., Cold Spring Harbor Laboratories, Cold Spring Harbor, 
New York, 1988. 

Antibodies that specifically bind islet transcription factors can be utilized to 
detect cells expressing a recombinant islet transcription factor such as Ngn3 (e.g., prior 
to transplantation). 

Production of Islet Cells by Expression of Transcription Factor-Encoding Nucleic Acids 
Islet cells can be produced according to the invention in a variety of ways. In 

general, the invention involves stimulating the production of an islet transcription factor. 

In an embodiment of particular interest, the invention involves enhancing islet 

transcription factor activity by introducing a nucleic acid encoding an islet transcription 

factor into a cell, usually a pancreatic cell. 

I. Production of Islet Cells In Vitro By Introduction of an Islet Transcription 
Factor-Encoding Nucleic Acid 

Nucleic acid encoding an islet transcription factor (e.g., Ngn3) can be introduced 
into a cell in vitro to accomplish expression in the cell to provide for at least transient 



expression. The cells into which the nucleic acid is introduced can be differentiated 
epithelial cells (e.g., pancreatic cells, gut cells, hepatic cells or duct cells), pluripotent 
adult or embryonic stem cells, or any mammalian cell capable of developing into p cells 
or cells capable of expression of insulin in vitro following expression of an islet 
transcription factor-encoding nucleic acid. The cell is subsequently implanted into a 
subject having a disorder characterized by a deficiency in insulin, which disorder is 
amenable to treatment by islet cell replacement therapy. In one embodiment, the host 
cell in which Ngn3 expression is provided and which is implanted in the subject is 
derived from the individual who will receive the transplant (e.g., to provide an 
autologous transplant). For example, in a subject having Type 1 diabetes, pluripotent 
stem cells, hepatic cells, gut cells or pancreatic cells can be isolated from the affected 
subject, the cells modified to express Ngn3-encoding DNA, and the cells implanted in 
the affected subject to provide for insulin production, or the transformed cells cultured so 
as to facilitate development of the cells into insulin-producing P-cells. Alternatively, 
pluripotent stem cells, hepatic cells, gut cells or pancreatic cells from another subject 
(the "donor") could be modified to express Ngn3-encoding DNA, and the cells 
subsequently implanted in the affected subject to provide for insulin production, or the 
transformed cells cultured so as to facilitate development of the cells into insulin- 
producing p-cells. 

[124] Introduction of nucleic acid into the cell in vitro can be accomplished according 

to methods well known in the art (e.g., through use of electroporation, microinjection, 
lipofection infection with a recombinant (preferably replication-deficient) virus, and 
other means well known in the art). The nucleic acid is generally operably linked to a 
promoter that facilitates a desired level of polypeptide expression (e.g., a promoter 
derived from CMV, S V40, adenovirus, or a tissue-specific or cell type-specific 
promoter). Transformed cells containing the recombinant nucleic acid can be selected 
and/or enriched via, for example, expression of a selectable marker gene present in the 
introduced construct or that is present on a nucleic acid that is co-transfected with the 
construct. Typically selectable markers provide for resistance to antibiotics such as 
tetracycline, hygromycin, neomycin, and the like. Other markers can include thymidine 



kinase and the like. Other markers can include markers that can be used to identify 
expressing cells, such as beta-galactosidase or green florescent protein. 

[125] Expression of the introduced nucleic acid in the transformed cell can be assessed 

by various methods known in the art. For example, expression of the introduced gene 
can be examined by Northern blot to detect mRNA which hybridizes with a DNA probe 
derived from the relevant gene. Those cells that express the desired gene can be further 
isolated and expanded in in vitro culture using methods well known in the art. The host 
cells selected for transformation will vary with the purpose of the ex vivo therapy (e.g., 
insulin production), the site of implantation of the cells, and other factors that will vary 
with a variety of factors that will be appreciated by the ordinarily skilled artisan. 

[126] The transformed cell can also be examined for the development of an islet cell 

phenotype. For example, expression of insulin could be detected by PCR, northern blot, 
immunocytochemistry, western blot. RIA or ELIS A Alternatively a marker gene such as 
green florescent protein or an antibiotic resistance gene operatively linked to an islet 
specific promoter such as the insulin gene promoter could be used for identification or 
selection of differentiated islet cells. Methods for engineering a host cell for expression 
of a desired gene product(s) and implantation or transplantation of the engineered cells 
(e.g., ex vivo therapy) are known in the art (see, e.g., Gilbert et al. 1993 "Cell 
transplantation of genetically altered cells on biodegradable polymer scaffolds in 
syngeneic rats," Transplantation 56:423-427). For expression of a desired gene in 
exogenous or autologous cells and implantation of the cells (e.g., islet cells) into 
pancreas, see, e.g., Docherty 1997 "Gene therapy for diabetes mellitus," Clin Sci (Colch) 
92:321-330; Hegre et al. 1976 "Transplantation of islet tissue in the rat," Acta 
Endocrinol Suppl (Copenh) 205:257-281; Sandler et al. 1997 "Assessment of insulin 
secretion in vitro from microencapsulated fetal porcine islet-like cell clusters and rat, 
mouse, and human pancreatic islets," Transplantation 63:1712-1718; Calafiore 1997 
"Perspectives in pancreatic and islet cell transplantation for the therapy of IDDM," 
Diabetes Care 20:889-896; Kenyon et al. 1996 "Islet cell transplantation: beyond the 
paradigms," Diabetes Metab Rev 12:361-372; Sandler; Chick et al. 1977 Science 
"Artificial pancreas using living beta cells:, effects on glucose homeostasis in diabetic 



rats," 197:780-782. In general, the cells can be implanted into the pancreas, or to any 
practical or convenient site, e.g., subcutaneous site, liver, peritoneum. 
[127] Methods for transplanting islets cells are well known in the art, see, e.g., Hegre et 

al. 1976 "Transplantation of islet tissue in the rat," Acta Endocrinol Suppl (Copenh) 
205:257-281; Sandler et al. 1997 "Assessment of insulin secretion in vitro from 
microencapsulated fetal porcine islet-like cell clusters and rat, mouse, and human 
pancreatic islets," Transplantation 63:1712-1718; Calafiore 1997 "Perspectives in 
pancreatic and islet cell transplantation for the therapy of IDDM," Diabetes Care 
20:889-896; Kenyon et al. 1996 "Islet cell transplantation: beyond the paradigms," 
Diabetes Metab Rev 12:361-372; Sandler; Chick et al. 1977 Science "Artificial pancreas 
using living beta cells:, effects on glucose homeostasis in diabetic rats," 197:780-782. 
q [128] In general, after expansion of the transformed cells in vitro, the cells are 

^5 implanted into the mammalian subject by methods well known in the art. The number of 

M» cells implanted is a number of cells sufficient to provide for expression of levels of 

y insulin sufficient to lower blood glucose levels. The number of cells to be transplanted 

;~ can be determined based upon such factors as the levels of polypeptide expression 

3 achieved in vitro, and/or the number of cells that survive implantation. The transformed 

cells are implanted in an area of dense vascularization such as the liver, and in a manner 
that minimizes surgical intervention in the subject. The engraftment of the implant of 
transformed cells is monitored by examining the mammalian subject for classic signs of 
graft rejection, i.e., inflammation and/or exfoliation at the site of implantation, and fever, 
and by monitoring blood glucose levels. 
[129] The transplantation method described above is not limited to the expression of 

nerougenin3. Engineering a host cell for expression of other islet transcription factors in 
the differentiation cascade, such as islet factors and in particular, NeuroDl/BETA2 may 
be beneficial to subjects with insulin deficiencies. 

II. In Vivo Development of Islet Cells and Production of Insulin in the Pancreas 
[130] Islet transcription factor-encoding nucleic acid can be delivered directly to a 

subject to provide for islet transcription factor expression in a target cell (e.g., a 
pancreatic cell, gut cell, liver cell, of other organ cell capable of expressing an islet 
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transcription factor and providing production of insulin), thereby promoting 
development of the cell into an insulin-producing cell (e.g., in pancreas) or to cure a 
defect in islet transcription factor expression in the subject. Methods for in vivo delivery 
of a nucleic acid of interest for expression in a target cell are known in the art. For 
example, in vivo methods of gene delivery normally employ either a biological means of 
introducing the DNA into the target cells (e.g., a virus containing the DNA of interest) or 
a mechanical means to introduce the DNA into the target cells (e.g., direct injection of 
DNA into the cells, liposome fusion, or pneumatic injection using a gene gun). 

[131] In general, the transformed cells expressing the protein encoded by the DNA of 

interest produce a therapeutically effective amount of the protein to produce islet cells, in 
particular P-cells in the mammalian patient. In one embodiment, the DNA of interest 
encodes an islet transcription factor such as Neurogeninl, Neurogenin2, Neurogenin3, 
NeuroDl/BETA2, Mashl or NeuroD4/Math3 (with Ngn3 being of particular interest), 
and the DNA of interest is operably linked to a promoter, which may be heterologous or 
endogenous to the transcription factor. 

[132] In general terms, the delivery method comprises introducing the nucleic of 

interest-containing vector into a pancreatic cell. By way of example, DNA of interest- 
containing vector may comprise either a viral or non-viral vector (including naked 
DNA), which is introduced into the pancreas in vivo via the duct system. Intraductal 
administration can be accomplished by cannulation by, for example, insertion of the 
cannula through a lumen of the gastrointestinal tract, by insertion of the cannula through 
an external orifice, or insertion of the cannula through the common bile duct. 
Retrograde ductal administration may be accomplished in the pancreas by endoscopic 
retrograde chalangio-pancreatography (ECRP). Exemplary methods for accomplishing 
intraductal delivery to the pancreas are described in U.S. Pat. No. 6,004,944. 

[133] The precise amount of islet transcription factor-encoding nucleic acid 

administered will vary greatly according to a number of factors including the 
susceptibility of the target cells to transformation, the size and weight of the subject, the 
levels of protein expression desired, and the condition to be treated. The amount of 
nucleic acid and/or the number of infectious viral particles effective to infect the targeted 



tissue, transform a sufficient number of cells, and provide for production of a desired 
level of insulin can be readily determined based upon such factors as the efficiency of 
the transformation in vitro and the susceptibility of the targeted cells to transformation. 
For example, the amount of DNA introduced into the pancreatic duct of a human is, for 
example, generally from about to about 750mg, preferably from about 500|ig to 
about 500mg, more preferably from about lOmg to about 200mg, most preferably about 
lOOmg. Generally, the amounts of DNA can be extrapolated from the amounts of DNA 
effective for delivery and expression of the desired gene in an animal model. For 
example, the amount of DNA for delivery in a human is roughly 100 times the amount 
of DNA effective in a rat. 

Pancreatic cells modified according to the invention can facilitate sufficiently 
high levels of expression of a nucleic acid of interest, particularly where the nucleic acid 
delivered is DNA and the DNA of interest is operably linked to a strong eukaryotic 
promoter (e.g., CMV, MMTV). The expressed protein can induce islet cell and insulin 
production. Thus the methods of the invention are useful in treating a mammalian 
subject having a variety of insulin related conditions. 

In the preferred embodiment, the encoded proteins are islet transcription factors 
from the class of basic helix-loop-helix (bHLH) proteins. For example, the expression of 
neurogeinin3 and/or NeuroDl/BETA2 may substantially induce the production of islet 
cells and insulin in mammals. 

The actual number of transformed pancreatic cells required to achieve therapeutic 
levels of the protein of interest will vary according to several factors including the 
protein to be expressed, the level of expression of the protein by the transformed cells, 
the rate in which the protein induces islet cell production (in particular Beta cells), and 
the condition to be treated. 

Regardless of whether the islet transcription factor-encoding nucleic acid is 
introduced in vivo or ex vivo, the nucleic acid (or islet cells produced in vitro or 
recombinant cells expressing the islet transcription factor nucleic acid that are to be 
transplanted for development into islet cells in vivo post-transplantation) can be 
administered in combination with other genes and other agents. 
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Assessment of Therapy 
[138] The effects of ex vivo or in vivo therapy according to the methods of the 

invention can be monitored in a variety of ways. Generally, a sample of blood from the 
subject can be assayed for, for example, levels of glucose, proinsulin, c-peptide, and 
insulin. Appropriate assays for detecting proinsulin, c-peptide, insulin and glucose in 
blood samples are well known in the art. Evidence for recurrent autoimmunity can be 
gauged by assaying for autoreactive T cells or for antibodies against islet proteins such 
as glutamic acid decarboxylase (GAD), or other autoantigens well known in the art. 
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EXAMPLES 

[139] The following examples are put forth so as to provide those of ordinary skill in 

the art with a complete disclosure and description of how to carry out the invention and 
is not intended to limit the scope of what the inventors regard as their invention. Efforts 
have been made to ensure accuracy with respect to numbers used (e.g., amounts, 
temperatures, etc.), but some experimental error and deviation should be accounted for. 
Unless indicated otherwise, parts are parts by weight, molecular weight is weight 
average molecular weight, temperature is in degrees Centigrade, and pressure is at or 
near atmospheric. 

Example 1: Detection of Ngn3 Expression in Murine Pancreas 
[140] Members of the basic helix-loop-helix (bHLH) family of transcription factors 

regulate growth and differentiation of numerous cell types. Insulin gene expression is 
activated by a heterodimeric complex of two bHLH proteins: a ubiquitously expressed 
(class A) protein and a cell-type-specific (class B) partner, NeuroDl/BETA2. 
NeuroDl/BETA2 is also important for P-cell development. The targeted disruption of 
the NeuroDl/BETA2 gene in mice leads to a marked reduction of the P-cell mass at birth 
due to increased apoptosis of islet cells late in fetal development. There is no apparent 
defect, however, in P-cell formation or insulin gene expression, despite the postulated 
importance of this factor in p-cell differentiation. 



-35- 



[141] Assuming that this modest phenotype reflected the redundant expression of 

closely related class B bHLH proteins in the endocrine pancreas, the inventors searched 
for additional members of the family by reverse transcriptase-polymerase chain reaction 
(RT-PCR) using degenerate oligonucleotides primers based on conserved amino acid 
sequences in the bHLH domain of the class B bHLH proteins (Sommer et al 1996 Mol 
Cell Neurosci. 8:221). PCR analysis revealed that pancreatic endocrine cell lines and 
isolated adult islets not only express NeuroDl, but also several other members of the 
family of neural class B bHLH genes as well, including Mashl, neuroD2 and 4 and 
neurogenins (ngn) 1, 2 and 3. This remarkable degree of redundancy could compensate 
for the loss of NeuroDl/BETA2 in mice. The two most commonly amplified sequences 
encoded NeuroD4 and Ngn3, but in situ hybridization studies in mouse pancreas showed 
highest expression of NeuroDl and Ngn3. These results were confirmed by 
immunohistochemistry. 

[142] Ngn3 is detected earliest at embryonic day 1 1.5 (el 1.5) in the mouse, increases to 

a maximum at el 5.5 and decreases at el 8.5, with no staining seen in the adult pancreas. 
Ngn3 is detected in the nuclei of scattered ductal cells and periductal cells, and there was 
no co-staining with any of the four islet hormones (insulin, glucagon, somatostatin and 
pancreatic polypeptide). This temporal and spatial pattern of expression implicated 
Ngn3 as a marker for islet cell precursors. Nkx6.1, a specific marker for future 
beta-cells, was expressed in 10-20% of the Ngn3 positive cells, further supporting the 
use of Ngn3 as a marker for islet cell precursors. The peak of Ngn3 expression at el 5.5 
also corresponds with the peak of new beta-cell formation in the fetus. Our data 
supports a model in which Ngn3 acts upstream of NeuroDl /BET A2 and other islet 
differentiation factors, marking islet cell precursors, but switching off prior to final 
differentiation. 

Example 2: Isolation and Sequencing of a Human Ngn3 Polypeptide-Encoding 
Polynucleotide 

[143] A probe derived from a cloned fragment of the murine Ngn3 gene (Sommer et 

aL, supra) was used to screen a human genomic library. This screen resulted in the 



isolation of the genomic sequence provided as SEQ ID NO:l in the sequence listing. 
Based on mapping of the murine start site using 5' RACE of mouse fetal pancreatic 
RNA, the transcriptional start site in the human Ngn3-encoding sequence is at nucleotide 
residue 2643. The coding sequence is between nucleotide residues 3022-3663, with a 
stop site at 3664-3666. No introns are within the 5 1 untranslated region (UTR) or the 
coding sequence of SEQ ID NO: 1 . 

The promoter of Ngn3 is of interest, particularly given that is it exceptionally 
well-conserved between mouse, rat, and human. Given the role of Ngn3 in pancreatic 
and islet cell development, the Ngn3 promoter is likely key to determining the number of 
islet cells in the mature pancreas. The regulatory region corresponding to the human 
Ngn3 promoter comprises sequences up to approximately 500 bp upstream of the 
transcription start site within the human Ngn3 promoter (e.g., from about 2144 to the 
transcriptional start site at 2643). FISH was used to identify the location of Ngn3 on the 
human chromosome at 10q22. 1-22.2. 

Example 3: Isolation and Sequencing of a Murine Ngn3 Polypeptide-Encoding 
Polynucleotide and Promoter 

The full-length murine Ngn3 sequence and its 5' flanking sequences, which 
included the murine Ngn3 promoter, were obtained by sequencing a previously obtained 
mouse genomic DNA fragment (Sommer, et al., supra). The murine Ngn3 sequence is 
provided in the Sequence Listing as SEQ ID NO:3, with the encoded polypeptide 
provided as SEQ ID NO:4. The transcriptional start site was determined using the 5' 
RACE method and confirmed using Rnase protection with RNA from fetal mouse 
pancreas, and is at nucleotide residue 719; the coding sequence for murine Ngn3 begins 
at nucleotide residue 1093. The promoter comprises a region approximately 500 bp 
upstream of the transcription start site. 

Example 4: Construction Of Adenovirus Vector Encoding Neurogenin3. 

The full length mouse neurogenin3 coding sequence was inserted downstream of 
the cytomegalovirus immediate early gene promoter (PCMV IE) in the Adeno-X viral 



genome, and intact viral particles were produced as per the instructions of the 
manufacturer (Clontech, Palo Alto, CA) for the Adeno-XTM Expression System. Fig. 1 
provides a map of the final Adeno-X.NGN3 construct. A control construct was 
produced using a lac Z coding sequence in lieu of Ngn3 to produce an Adeno.LacZ 
construct. Large scale preparation of adenovirus for the experiments below was 
performed using protocols well known in the art. Virus concentration was estimated 
based on the protein concentration of the purified virus stock, not by plaque assay. 
Identity of the virus was confirmed by PGR during preparation, and by Western blot for 
neurogenic using lysates from cells infected with the purified stock. 

Example 5: Induction of insulin in normal adult rats by treatment with Adeno-X.NGN3. 
[147] Adult male Sprague-Dawley rats weighing 250-350g were injected with either 

Adeno.LacZ or Adeno-X.NGN3 into the pancreatic duct using the previously described 
surgical technique, (Goldfine etal Nat Biotechnol 15:1378-82, 1997). Approximately, 
3X10 10 viral particles (low dose) or high dose, 3X10 11 (high dose were injected into the 
pancreatic duct of each animal. After recovery from the surgery, the animals were 
returned to a normal diet. After approximately 48 hours, the animals subjected to the 
low dosage of Adeno-X.NGN3 were sacrificed, the pancreases were removed, fixed, 
embedded in paraffin and sectioned. The pancreatic sections were stained for insulin, 
glucagon, cytokerratin20 (a marker for ducts), and mouse neurogenic using established 
immunohistochemical techniques and antisera (Schwitzgebel et al. Development 
127:3533-3542, 2000.). 

[148] Animals injected with the low or high dose of Adeno-X.NGN3 had individual or 

small clumps of 8 or fewer insulin or glucagon staining cells found scattered near the 
ducts, in addition to the normal large (approximately 1000 cells) aggregations of 
glucagon and insulin staining cells that form the islets of Langerhans. Fig. 2 A shows the 
results of histological staining for insulin in a pancreatic tissue section of an animal 
injected with low dose (3 X 10 10 ) Adeno-X.NGN3. Individual and small clumps of 
insulin staining cells are indicated with arrows. Figure 2B shows a serial section from 
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the same animal stained for the duct cell marker cytokeratin 20, with the arrows 
indicating the position of the insulin staining in Figure 2 A. 

[149] Animals injected with the control Adeno.LacZ virus had no scattered insulin or 

glucagon staining cells outside of the islets of Langerhans. All of the animals injected 
with high titer of the neurogenic expressing adenovirus died within 36 hours after 
injection. One of the high dose animals before death had a blood glucose level of 32 
mg/ml measured by the glucose oxidase method using the Glucometer Elite meter 
(Bayer Corporation, Elkhart, IN). The blood glucose level for fasted rat is normally 
approximately 100 mg/dl and rarely seen below 70 mg/dl. The blood glucose level of 32 
mg/ml for the high dose animal indicates that excess insulin was being produced to 
lower the blood glucose to hypoglycemic levels. The animals injected with the high 
dose of DNA encoding Ngn3 appeared to have died from hypoglycemia induced by the 
overexpression of insulin from the newly produced P cells. 

[150] Fig. 3 shows a confocal image with fluorescent co-staining for insulin in red and 

neurogenein3 in green in the pancreas of an animal infected with high dose ( 3X 10 H ) 
Adeno-X.NGN3 and harvested at approximately 24 hours. It can be seen that the cell in 
the center of the figure stains for insulin in the cytoplasm and neurogenic in the 
nucleus. No neurogenic could be detected in the pancreases of the control animals. 

Example 6: Normalization of Blood Glucose Levels In Diabetic Induced Adult Rats 
Induced By The Introduction Of DNA Encoding Murine Ngn3 Into The Pancreas 
[151] Diabetes was induced in adult male Sprague-Dawley rats weighing 250-350g by 

injection with the beta-cell toxin of streptozotocin (Sigma: 40mg/kg of body weight, in 
ImM citrate buffer, pH 4.5) into the peritoneal space on day 1 and day 2. Streptozotocin 
is a P-cell toxin which induces diabetes mellitus in rats. On day 4, animals that were 
confirmed to have hyperglycemia (blood glucose greater than 300 mg/dl measured by 
the glucose oxidase method using the Glucometer Elite meter (Bayer Corporation, 
Elkhart, IN) were injected with either Adeno.LacZ or Adeno-X.NGN3 into the 
pancreatic duct using the previously described surgical technique. Approximately, 
3X10 10 or 3X10 11 viral particles were injected into the pancreatic duct of each animal. 



[152] Five days after surgery, the animals were sacrificed, the pancreases were 

removed, fixed, embedded in paraffin and sectioned. The pancreatic sections were 
stained for insulin, glucagon, cytokeratin 20 (a marker for ducts), and mouse 
neurogenic. Fig. 4 A shows the results of histological staining for insulin in a 
pancreatic tissue section of an animal injected with Adeno-X.NGN3 which indicates the 
production of insulin in accordance of the invention. Individual and small clumps of 
insulin staining cells are indicated with red arrows. An islet with a few residual insulin 
staining cells (the remainder having been destroyed by streptozotocin) is indicated with a 
black arrow. 

[153] Fig. 4B shows the results of staining for insulin in the pancreas of an animal 

injected with Adeno-LacZ virus. Two islets with a few residual insulin staining cells are 

>=& indicated with black arrows. No individual or small clumps of insulin staining cells 

U 

<0 were detected in the control animals. 

m 

[154] The animals injected with the Adeno-X.NGN3 virus had significantly lower 

blood glucose levels than the control animals injected with Adeno.LacZ. As shown in 
Fig. 5, blood glucose levels were significantly decreased in the diabetic rats that received 
AdenoX-.NGN3 relative to the diabetic rats that received Adeno.LacZ. The decreased 
blood glucose levels were observed on both day 2 and day 4. Thus, these data show that 
jjjf introduction of Ngn3 -encoding DNA into the pancreas results in persistent expression of 

O insulin, and that the insulin expressed by the transformed pancreatic cells is secreted into 

r . 

the bloodstream and can function in regulation of blood glucose at levels sufficient to 
overcome diabetes in an animal model. 
[155] The present invention demonstrates that cells in the mature pancreas, most likely 

duct cells, retain the capacity to differentiate into endocrine cells. Adult duct cells 
respond to pro-endocrine bHLH genes, and in particular, Ngn3, and in combination with 
other signals these genes can induce new beta cell formation and subsequent insulin 
production. The present invention will be useful in the maintenance and treatment of 
type 2 diabetes and also allow for the replacement of beta cells lost to autoimmune 
destruction in individuals with type 1 diabetes. 
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Example 7: Overexpression Of Ngn3 And Islet Cell Production In Ngn3 Transgenic Animals 
[156] As previously described (Schwitzgebel et al Development 127:3533-3542, 

2000), the pdxl promoter vector pBAT.PD17 was constructed by inserting the mouse 
pdxl promoter (a 4.4 kb Xbal-Sma I fragment from the mouse Pdxl gene containing the 
transcription start site and promoter (Apelqvist et al Curr Biol 7:801-804; Wu et al Mol 
Cell Biol 17:6002-6013 (1997)), and the human beta-globin gene first intron upstream of 
the pBAT polylinker (German et al Genes & Dev 6:2165-2176), and the SV40 late gene 
polyadenylation signal. A 663bp DNA fragment encoding full length mouse ngn3 
cDNA was obtained by PCR from the mouse genomic neurogenin3 clone (Sommer et al 
Mol Cell Neurosci 8:221-241 (1996)) and inserted into the pBAT.PD17 polylinker. 
[157] The vectors were linearized and purified, and transgenic mice were generated by 

pronuclear injection (1.5ng/^l) into Fl hybrid oocytes from C3Fe/B6 parents (Hogan et 
k Q al (1994) Manipulating the mouse embryo: A Laboratory Manual. New York: Cold 

Spring Harbour Laboratory Press.). Genotypes were determined by PCR analysis of 
genomic DNA from tail biopsies. The primers used were: 5' 
TGGAGAACTGTCAAAGCGATCTG (SEQ ID NO:5) (Pdxl-primer for 5') and 5' 
CACATGCCCAGTTTCTATTGGTC (SEQ ID NO:6) (human beta-globin intron for 
3'). 

[158] Embryos were harvested at embryonic day 12.5 (E12.5) or E18.5. A total of 6 

pdxl-ngn3 animals were examined at E12.5, and 10 at E18.5. The transgenic embryos 
were not grossly abnormal in size. Transgenic embryos harvested at E12.5 have an 
increase in the numbers of islet cells as indicated in Fig. 6B by staining for glucagon. 
The control, non-transgenic embryos have normal numbers of glucagon expressing cells 
(Fig. 6A). At El 8.5, the pancreas size is grossly reduced in the transgenic animals, but 
the fraction of islet cells is markedly increased relative to the same age control animals 
(Fig. 7A). 

[159] These experiments demonstrate that neurogenin3 is capable of inducing islet cell 

neogenesis in appropriate progenitor cells. 
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Example 8. Islet cell production in NeuroDl transgenic animals - 
[160] The neuroDl/BETA2 vector was constructed by cloning into pBAT.PD17 a 1.7 

Kb DNA fragment encoding the full length mouse cDNA (Lee et al. (1995) Science 
268:836-844) extending from the start codon through the 3' UTR. 
[161] The vectors were linearized and purified, and transgenic mice were generated by 

pronuclear injection (1.5ng/|il) into Fl hybrid oocytes from C3Fe/B6 parents as 
described (Hogan et al. (1994), supra). Genotypes were determined by PCR analysis of 
genomic DNA from tail biopsies. The primers used were: 5' 
TGGAGAACTGTCAAAGCGATCTG (SEQ ID NO:5) (Pdxl -primer for 5') and 5' 
CACATGCCCAGTTTCTATTGGTC (SEQ ID NO:6) (human beta-globin intron for 
3'). 

[162] Embryos were harvested at embryonic day 12.5 (E12.5) or E18.5. A total of 5 

pdxl-neuroDl animals were examined at E12.5, and 10 at E18.5. Similar to the 
neurogenic transgenic animals, the pdxl-neuroDl transgenic embryos harvested at 
El 2.5 have an increase in the numbers of islet cells as indicated in Fig. 6C by staining 
for glucagon. At El 8.5, the pancreas size is grossly reduced in the transgenic animals, 
but the fraction of islet cells is markedly increased relative to the same age control 
animals (Fig. 7B). 

[163] These experiments demonstrate that like neurogenic, neuroDl is capable of 

inducing islet cell neogenesis in appropriate progenitor cells. 

Example 9. Construction of Adenovirus Vector Encoding neuroDl 
[164] The full-length mouse or human neuroDl coding sequence is inserted 

downstream of the cytomegalovirus immediate early gene promote (PCMV IE) in the 
adenoviral genome, and intact viral particle produced according to methods well known 
in the art, for example as per the instructions of the manufacturer (Clontech, Palo Alto, 
CA), for the Adeno-XTM Expression System. Fig. 8 provide a map of such an 
exemplary Adeno-NeuroDl construct. 
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Example 10. Construction of Adenovirus Vector Encoding mash 1/ASCL1 /ASH 1 
[165] The full length mouse mashl or human ACSL1/ASH1 (human symbols for 

masl) coding sequence are inserted downstream of the CMV immediate early gene 
promoter (PCMV IE) in the adenoviral genome, and intact viral particle produced 
according to methods well known in the art, for example as per the instructions of the 
manufacturer (Clontech, Palo Alto, CA), for the Adeno-XTM Expression System. Fig. 8 
is a map of such an exemplary Adeno-mashl/ASCLl/ASHl construct. 

Example 1 1 : Induction of the formation of insulin-producing beta-cells in normal adult 
rats by treatment with Adeno-NEURDl 
[166] Adult male Sprague-Dawley rats weighing 250-350 g are injected with either 

Adeno.LacZ or Adeno-neuroDl (mouse or human) into the pancreatic duct using 
previously described surgical techniques (see, e.g., Goldfine, et ah Nat. Biotechnol 
15:1378-82 (1997)). Approximately 3X10 10 viral particles (low dose) or 3X10 11 (high 
dose) are injected into the pancreatic duct of each animal. After recovery from the 
surgery, the animal are returned to normal diet. After approximately 48 hours, the 
animals are sacrificed, the pancreases removed, fixed, embedded n paraffin, and 
sectioned. The pancreatic sections are stained for insulin, glucagons, cytokerratin20 (a 
marker for ducts), and neuroDl using established immunohistochemical techniques and 
antisera (Schwitzgebel, et ah Development 127:3533-3542, 2000). 

Example 12: Induction of the formation of insulin-producing beta-cells in normal adult 
rats by treatment with Adeno-mashl/ASCLl/ASHl . 
[167] Adult male Sprague-Dawley rats weighing 250-350 g are injected with either 

Adeno.LacZ or Adeno-mashl/ASCLl/ASHl (mouse or human) into the pancreatic duct 
using previously described surgical techniques (see, e.g., Goldfine, et ah Nat. Biotechnol 
15:1378-82 (1997)). Approximately 3X10 10 viral particles (low dose) or 3X10 11 (high 
dose) are injected into the pancreatic duct of each animal. After recovery from the 
surgery, the animal are returned to normal diet. After approximately 48 hours, the 
animals are sacrificed, the pancreases removed, fixed, embedded n paraffin, and 
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sectioned. The pancreatic sections are stained for insulin, glucagons, cytokerratin20 (a 
marker for ducts), and neuroDl using established immunohistochemical techniques and 
antisera (Schwitzgebel, et ah Development 127:3533-3542, 2000). 



Example 13: Production of insulin in diabetic induced adult rats by the introduction of 

DNA encoding NeuroDl into the pancreas 

[168] Diabetes is induced in adult male Sprague-Dawley rats weighing 250-350g by 

injection with the beta-cell toxin streptozotocin (Sigma: 40 mg/kg of body weight, in 

ImM citrate buffer, pH 4.5) into the peritoneal space on day 1 and day 2. Streptozotocin 

is a beta-cell toxin which induces diabetes mellitus in rats. On day 4, animals confirmed 

to have hyperglycemia (blood glucose greater than 300 mg/dl measured by the glucose 

oxidase method using the Glucometer Elite meter (Bayer Corporation, Elkhart, IN) are 

\Q injected with either Adeno-LacZ or Adeno-NEURODl (mouse or human) into the 

IB 

pancreatic duct using the previously described surgical technique. Approximately 
3X10 10 or 3X10 11 viral particles are injected into the pancreatic duct of each animal. 
0; Animals would then be monitored every 12 hours for a fall in blood glucose levels. 

£1 

r\ Example 14: Production of insulin in diabetic induced adult rats by the introduction of 

y DNA encoding mash 1/ASCL1 /ASH 1 into the pancreas 

Q [169] Diabetes is induced in adult male Sprague-Dawley rats weighing 250-350g by 

injection with the beta-cell toxin streptozotocin (Sigma: 40 mg/kg of body weight, in 
ImM citrate buffer, pH 4.5) into the peritoneal space on day 1 and day 2. Streptozotocin 
is a beta-cell toxin which induces diabetes mellitus in rats. On day 4, animals confirmed 
to have hyperglycemia (blood glucose greater than 300 mg/dl measured by the glucose 
oxidase method using the Glucometer Elite meter (Bayer Corporation, Elkhart, IN) are 
injected with either Adeno-LacZ or Adeno- mashl/ASCLl/ASHl (mouse or human) 
into the pancreatic duct using the previously described surgical technique. 
Approximately 3X1 0 10 or 3X10 11 viral particles are injected into the pancreatic duct of 
each animal. Animals would then be monitored every 12 hours for a fall in blood 
glucose levels. 
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Example 15: Construction of Plasmid Vector Encoding Neurogenin3 
[170] The full length mouse or human neurogenic coding sequence, either cDNA or 

gene is inserted downstream of the CMV immediate early gene promoter (PCMV IE) 
and the human beta-globin gene intron, and upstream of the SV40 polyadenylation 
signal in the pBAT16 plasmid vector, which also contains the colEl origin of replication 
for high copy number replication in bacteria, and the bacterial ampicillin resistance gene 
for selection of plasmid-containing bacteria, Purified plasmid DNA is produced 
according to methods well known in the art. Fig. 9 provides a map of such an exemplary 
pBAT16.hNGN3 construct. 

Example 16: Production of insulin in diabetic induced adult rats by the introduction of 
plasmid DNA vector pBAT16.hNGN3 into the pancreas 
[171] Diabetes is induced in adult male Sprague-Dawley rats weighing 250-350g by 

injection with the beta-cell toxin streptozotocin (Sigma: 40 mg/kg of body weight, in 
ImM citrate buffer, pH 4.5) into the peritoneal space on day 1 and day 2. Streptozotocin 
is a beta-cell toxin which induces diabetes mellitus in rats. On day 4, animals confirmed 
to have hyperglycemia (blood glucose greater than 300 mg/dl measured by the glucose 
oxidase method using the Glucometer Elite meter (Bayer Corporation, Elkhart, IN) are 
injected with either Adeno-LacZ or Adeno- mash 1/ASCL1 /ASH 1 (mouse or human) 
into the pancreatic duct. The ability of naked DNA, with or without adjuvants such as 
cationic lipids, to express genes in pancreatic cells when injected into the pancreatic duct 
has been described previously (Goldfine et al 1997, supra). In a similar manner, 8-25 
jag of the plasmid DNA is injected into a lumen of the pancreatic duct with or without 
adjuvant such as the cationic lipid reagent Transfast (Promega). Animals are then 
monitored every 12 hours for a fall in blood glucose levels. 
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Example 1 7: Production of insulin in mice with autoimmune diabetes by the 
introduction of AdenoX-NGN3 or plasmid DNA vector pBAT16.hNGN3 into the • 
pancreas 

[172] The NOD mouse develops a form of autoimmune diabetes that is an accepted 

model for human autoimmune, or "type 1" diabetes mellitus (for a review, see, e.g., Bach 
et al. The NOD mouse. Research in Immunology, 1997 Jun, 148(5):285-6). These 
animals have been used as a model for testing in an autoimmune setting treatment for 
diabetes, such as islet transplantation (for reviews see, e.g., Hahn et al. Adv. Exp. Med 
Biol. 1997 426:41 1-9; Sutherland Transplantation Proc. 1996 , 28(4):2131-3). 
[173] In this example, adult mice greater than 15 weeks of age are tested for diabetes 

(blood glucose greater than 300 mg/dl measured by the glucose oxidase method using 
the Glucometer Elite meter (Bayer Corporation, Elkhart, IN) are injected with either 
^ Adeno-LacZ or AdenoX-NGN3 (mouse or human) into the pancreatic duct using the 

53 previously described surgical technique. Approximately 3X109 or 3X10 10 viral particles 

are injected into the pancreatic duct of each animal, or 2-25 |ig of plasmid DNA. 
Animals are monitored every 12 hours for a fall in blood glucose levels. 
[174] To prevent recurrence of autoimmunity and destruction of the newly formed 

Q beta-cells, several methods are well known in the art, including, but not limited to, the 

use of drugs that suppress the immune system, such as Cytoxan or FK506, or reagents 
that block co-stimulatory molecules such as antibodies to CTLA-4 (Shapiro et al. New 
Engl. 1 Med. 2000 343(4):230-8; Griffin et al. J. Immunol 2000 164(9):4433-42). 
Animals are treated with the anti-autoimmune therapy starting 2 days prior to DNA 
injection, and maintained on the immunotherapy throughout the experiment. 

Example 18: Induction of the formation of islet cell in vitro 
[175] In this example, neurogenin3 is used to induce islet cell formation form non-islet 

cells cultured in vitro. The cells used can include, but would not necessarily be limited 
to, immortalised mammalian cell lines, or primary cultured mammalian cells including 
cells from the gastrointestinal organs such as pancreatic duct cells, pancreatic acinar 
cells, gut cells including crypt cells, liver cells, and salivary gland cells; adult stem cells 
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such as hematopoietic stem cells, neural stem cells, muscle stem cells or pancreatic stem 
cells or embryonic stem cells. Cells are cultured using methods well known in the art. 
Neurogenin3 is introduced by viral vector (such as with the AdenoXZ-NGN3 vector) or 
naked DNA using a DNA plasmid vector such as pBAT16.hNGN3 along with adjuvant 
such as the cationic lipid transfection reagent Transfast. Cells are continued in culture 
for a period ranging from one day to several weeks. Detection of newly formed islet 
cells is performed by measuring for islet hormones such as insulin or glucagons with 
radio immune assays or ELIS A in an extract made from a sample of the cells, or by 
measuring hormone levels in the culture media using methods well known in the art. 

Example 19: Delivery of Ngn3 -Encoding Nucleic Acid to a Human Subject 
[176] In this example, patients with diabetes mellitus or a relative deficiency of insulin 

are treated with neurogenic en coding DNA with the purpose of inducing new islet cell 
formation. The neurogenin3 encoding DNA can be contained in a viral vector, as in the 
AdenoX-NGN3 example above, or in a naked DNA vector, as in the pBAT16.NGN3 
example above with or without an adjuvant such as the cationic lipid Transfast. The 
vector is introduced into the pancreas retrograde through the pancreatic duct. In the 
example where the vector is an adenovirus, the amount injected is determined by the 
amount needed to lower the blood glucose and maintain it in a normal range, may be 
approximately about 10 11 to about 10 14 viral particles. Where the vector is a naked DNA 
vector, the amount required may be from about lOOjag to lOOmg of DNA. 
[177] To prevent recurrence of autoimmunity and destruction of the newly formed 

beta-cells, several methods are well known in the art, and have been used clinically in 
humans, including, but not limited to, the use of drugs that suppress the immune system, 
such as Cytoxan, FK506, or sirolimus; or reagents that block co-stimulatory molecules 
such as antibodies to CTLA-4 (Shapiro et al. New Engl J. Med. 2000 343(4):230-8; 
Griffin et al 1 Immunol, 2000 164(9):4433-42). Patients are treated with the anti- 
autoimmune therapy starting 1-2 days prior to DNA administration, and maintained on 
the immunotherapy afterwards. 



After the procedure, the blood glucose is monitored closely, as often as hourly, or 
by use of a continuous glucose monitor, and insulin therapy by subcutaneous injection 
decreased or stopped as needed. If blood glucose does not completely normalize without 
exogenous insulin, the procedure is repeated. To monitor for recurrent autoimmune 
destruction of the new beta-cells, the blood glucose is monitored periodically. IN 
addition direct evidence of an immune response to new cells can be tested by assaying 
for autoreactive T cells, antibodies to glutamic acid decarboxylase (anti-GAD 
antibodies) or to islet antigens (islet cell antibodies or ICA). Evidence of recurrent 
autoimmunity can be treated with an increase or other change in the immunosuppression 
therapy. 

Example 20: Characterization of and Regulation of the Neurogenin3 Promoter 

The following experiments provide for characterization of the Ngn3 promoter, as 

well as identification of factors both upstream and downstream of Ngn3 that participate 

in the regulatory pathway that controls Ngn3 expression. 
Materials and Methods 

The following materials and methods were used in the present example. 

Cloning of the mouse and human ngn 3 gene promoter . A lambda phage genomic 
clone containing the mouse neurogenin 3 open reading frame, clone 17/6-1-1-2 (Sommer 
et al Mol Cell. Neurosci. 8:221-241, 1996). From this phage clone, a lkb fragment 
containing sequences upstream of the open reading frame was subcloned and sequenced. 
Human neurogenin 3 genomic clones were obtained by screening a lambda DASH 
human genomic library with the mouse neurogenin 3 genomic fragment. The clone 
containing the longest 5' flanking sequence, clone 14H, was subcloned, sequenced, and 
used for generating reporter gene plasmids. 

5 ? Rapid amplification of cDNA ends (RACE). The 5 5 end of the mouse ngn3 
cDNA was identified by 5 '-RACE, using a modification of the protocol from the 5'- 
RACE System Version 2.0 (GibcoBRL). For mouse cDNA, 2.5 pmol of specific primer 
JL1 (5 ' - ATCCTGCGGTTGGGAA-3 ' (SEQ ID NO:7)) was annealed to 1 \x% of total 
RNA from mouse El 5.5 pancreas. Reverse transcription was carried out using 
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Superscript II reverse transcriptase (GibcoBRL). After first strand cDNA synthesis, the 
original mRNA template was removed by treatment with RNase and homopolymeric 
dCTP tails was then added to the 3 '-end of the cDNA using terminal deoxynucleotidyl 
transferase. Using this products as a template, we carried out 35 cycles of PCR using the 
5' RACE Abridged Anchor Primer (Gibco BRL) and JL2 (5'- 

TGGAAGGTGTGTGTGTGCC AG-3 ' (SEQ ID NO:8)) as primers. For the nested PCR, 
we used Abridged Universal Amplification Primer (Gibco BRL) and JL3 (5'- 
GATCTAGAGACTTAGAGGTCACTGC-3 ' (SEQ ID NO:9)) as primers, and 
performed 35 cycles of PCR. The PCR products were subcloned and sequenced. 
[183] Reporter gene constructs. To generate reporter plasmids, fragments of the 5' 

region of the human ngn3 gene obtained by restriction digestion were ligated upstream 
of the luciferase gene in the plasmid pFOXLucl or upstream of the TK minimal 
promoter gene in the plasmid pFOXLucl TK (Mirmira et al J Biol Chem 275:14743- 
51,2000). 

[184] Cell culture and transient transfections. pTC3 cells, aTC1.6, and MP AC cells 

were grown in Dulbecco's Modified Eagle Medium (DMEM) supplemented with 2.5% 
fetal bovine serum and 15% horse serum. NIH3T3 cells were grown in DMEM medium 
supplemented with 10% calf serum. Cos7 cells were grown in DMEM medium with 
10% fetal bovine serum with 4mM glutamine. For transient mammalian cell 
transfections, cells were plated in six-well tissue culture plates 24 h before transfection. 
For the standard reporter gene analysis, 2 |ig of luciferase reporter plasmids were 
transfected into the cells using TRANSFAST lipid reagent (Promega) according to the 
manufacturers instructions. For assessing the effect of the expression of HES1 on the 
ngn3 promoter, we cotransfected the amount of HES1, or dominant negative Hesl 
(pcDNA3Hesl (kindly provided by R. Kageyama, Kyoto University (Sasai et al. Genes 
Develop. 6:2620-2634, 1992) expression plasmid DNA indicated with 2 \ig of luciferase 
reporter plasmids. Forty-eight hours after transfection, cells were harvested and 
luciferase assays were performed as described previously (German et al Genes & Dev. 
6:2165-2176, 1992). Luciferase activity was corrected for cellular protein concentration. 



All reporter gene analyses were performed on at least three occasions and data are 
expressed as mean ± SEM. 

[185] Generation of transgenic mouse and detection of p-galactosidase. Theplasmids 

pNAT6B and pNAT3B were generated by ligating human ngn3 promoter fragments 
extending from -5.7 kb to +261 bp and from -2.6 kb to +261 bp upstream of the human 
p-globin intron and the bacterial p-galactosidase gene. Each plasmid was linearized and 
microinjected (1.5 ng/|il) into murine pronuclei. The injected embryos were transferred 
to pseudopregnant females and the fetal pancreata with stomach and small intestine were 
harvested at el 5.5 from the founder mice . Tissues were pre-fixed for 30 minutes at 4°C 
in 4% paraformaldehyde, and then incubated overnight in X-gal (400 (ig/ml) substrate at 
37°C (-2.6 kb promoter) or room temperature (-5.7 kb promoter). Tissues were then 
fixed again in 4% paraformaldehyde for 30 minutes, paraffin embedded, and sectioned at 
5(iM. Genotype was determined by PCR using primers specific for the human ngn3 
promoter sequence. P-galactosidase activity was assayed in 6 independent founder 
fetuses that had integrated the -2.6 kb promoter construct, and in 8 independent founder 
fetuses that had integrated the -5.7 kb promoter construct. 

[186] Immunohistochemistry . Immunohistochemistry was performed on paraffin 

embedded sections as described previously (Schwitzgebel et al. Development 127:3533- 
3542, 2000.). Primary antibodies were used at the following dilutions: guinea pig anti- 
insulin (Linco), 1:5000; guinea pig anti-glucagon (Linco), 1:10000; rabbit anti-ngn3 
(Schwitzgebel et al Development \ 27:3533-3542, 2000), 1:5000. Biotinylated 
secondary antibodies (Vector) were detected with the ABC Elite immunoperoxidase 
system (Vector). 

[187] Preparation of proteins and elecrophoretic mobility shift assay (EMS A). HNF3p 

and HNFla proteins were produced in vitro using SP6 and T7 TNT Quick Coupled 
Lysate System® (Promega) using pGEM-lratHNF3p (generous gift from R. Costa, 
University of Illinois at Chicago) and pcDNA3-HNFla (generous gift from M. Stoffel, 
Rockefeller University) as templates. Glutathione S-transferase (GST) fused HES1 
protein was produced in Escherichia Coli BL21 competent cells using the pGEX2T 
plasmid system (Promega). Nuclear extracts from cxTC1.6 cells, PTC3 cells, and 
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NIH3T3 cells were prepared following the procedure described by Sadowski and Gilman 
( Sadowski and Gilman. Nature 362:79-83, 1993). 

[188] Single-stranded oligonucleotides corresponding to the sequences in the human 

ngn3 promoter were 5' end-labeled with [y- 32 P]-ATP using T4 polynucleotide kinase. 
The labeled oligonucleotides were column-purified and annealed to an excess of the 
complementary strand. For HNF3P and HNFla binding experiments, EMS A buffers 
and electrophoresis conditions were as previously described (Mirmira et al J Biol Chem 
275:14743-51, 2000). For HES-1 binding experiments, conditions were the same except 
that the poly(dl-dC) concentration was decreased to 15ng/ul. One (il of the in vitro 
reaction mixture or 2ug of nuclear extracts or 400ng of GST-fused Hesl protein were 
used for each binding reaction. When using antibodies, 1 \x\ of each antibody was 
incubated with the binding mix for 1 5min at room temperature prior to gel- 
electrophoresis. The antisera against HNF-3a, -3(3, and -3y were a generous gift from R. 
Costa (University of Illinois) and the HES-1 antiserum was a generous gift from Y. Jan ( 
University of California San Francisco). The anti-HNF-la antiserum was purchased 
from Santa Cruz Biotechnology, Inc. 

[189] The following oligonucleotides were used as labeled probes or competitors in 

EMS A reactions (top strands shown): 

H3-1: GATCTCTCGAGAGAGCAAACAGCGCGGCGG (SEQ ID NO: 10) 
H3-2: TTATTATTATTTTAGCAAACACTGGAGACAG (SEQ ID NO:l 1) 
HI: ATCTCTTGTAATTATTTATTAAACGAAATCTATT (SEQ ID NO: 12) 
H2:TTAAACGAAATCTATTTATTATTATTTTAGCAAA (SEQ ID NO: 13) 
HIP: GATCTCGCC ACGAGCC AC AAGG ATTG (SEQ ID NO: 14) 
E1:GATCTAAATTTCCCCATGTGTAACGTGCAG (SEQ ID NO: 15) 
Nl: GATCTGGAGCGGGCTCGCGTGGCGCGGCCCCG (SEQ ID NO: 16) 
N2: GATCTGCCGGGCAGGCACGCTCCTGGCCCGG (SEQ ID NO:17) 
N3/4: GATCTAAAGCGTGCCAAGGGGCACACGACTG (SEQ ID NO: 18) 
Mapping the human ngn3 promoter 
[190J As an initial step in understanding the regulation of neurogenin3 gene 

expression, the sequences mouse and human neurogenin3 promoters (Figs. 1 1 A-l IB). 



Using RNA purified from embryonic day 15.5 fetal mouse pancreas, the transcription 
start sites of the murine ngn3 gene was determined by 5' Rapid Amplification of cDNA 
Ends (5' RACE). All 5' RACE products identify the same start site, 30 base pairs 
downstream from a putative TATAA box (Fig. 1 IB). The region upstream of the start 
site is highly conserved in mouse, human and rat, with the region of highest homology in 
mouse and human extending approximately 300 bp upstream. A CCAAT sequence 
element lies at -85 bp relative to the transcription start site. Several other potential 
sequence elements are identified in Fig. 1 IB.. 
Activity of the promoter in cell lines 

A series of progressive 5 'deletions of the neurogenin3 promoter each extending 
to +261 bp on the 3 1 end were linked to the firefly luciferase gene and were tested in cell 
lines (Fig. 12). Serial deletions down to -502 bp do not diminish the promoter activity 
in vitro. Surprisingly, the promoter drives transcription at a high level in all the tested 
cell lines, including the fibroblast cell lines. This high non-specific activity appears to 
reside in the proximal promoter, since the shortest construct is still very active in all the 
examined cell lines. 

Activity of the promoter in transgenic mice 

While transient transfections in cell lines may provide some indication of 
promoter activity, these tumor cells are not representative of the cells in the developing 
pancreas where neurogenic is normally expressed. Therefore, mice were produced 
carrying a transgene with either 5.7 kb or 2.6 kb of the upstream sequence from the 
human neurogenin3 gene driving the bacterial gene encoding P-galactosidase. Founder 
mice were harvested at embryonic day 15.5, at the normal peak of neurogenic 
expression in the fetal mouse pancreas (Schwitzgebel et al Development 127:3533- 
3542, 2000). 

Animals carrying the 5.7 kb construct strongly and selectively express P- 
galactosidase in central regions of the developing pancreas and in the gut epithelium, the 
same regions where neurogenic is normally expressed at this time during development 
(Fig. 13 A). Although the level of expression is significantly lower with the 2.6 kb 



construct, the overall pattern of P-galactosidase expression is the same (Figs. 13B, 13C 
and data not shown). 

[194] Immunohistochemistry was used to identify the cells expressing P-galactosidase 

in the transgenic mice carrying the 5.7 kb promoter construct (Figs. 14A and 14B). The 
p-galactosidase expressing cells are predominantly localized to the ducts. Most of the p- 
galactosidase expressing cells do not express islet hormones, although occasional p-cells 
co-express insulin and p-galactosidase. 

[195] Despite the close co-localization of p-galactosidase activity and neurogenin3 

protein expression specifically in the same regions of the developing pancreas and gut, 
there is not a perfect match. Some of the P-galactosidase positive cells co-express high 
levels of neurogenin 3, but many do not, and many neurogenin 3 -expressing cells contain 
little or no p-galactosidase activity. Most likely this discrepancy derives from 
differences in the timing of accumulation and degradation of the two gene products, 
rather than a difference in the onset and extinction of gene expression. 

[196] The exact timing for initial detection of each gene product in a particular cell 

depends on its rate of accumulation and threshold for detection, and therefore should not 
be expected to be identical. In addition, some neurogenic expressing cells may 
randomly silence the transgene, a poorly understood phenomenon observed with many 
promoters in transgenic mice (Graubert et ai Nucleic Acids Res 26:2849-58, 1998). The 
very brief but abundant expression of neurogenin 3 in progenitor cells indicates that the 
mRNA and protein accumulate rapidly but have very short half-lives. P-galactosidase in 
contrast has a fairly long half-life in mammalian cells (Smith et al J Virol 69:4593-9, 
1995) and could be expected to peak later and persist in cells after neurogenin3 is no 
longer detectable. Therefore many of the P-galactosidase expressing cells apparently 
represent a stage of islet cell differentiation that occurs after neurogenin 3 gene 
production has ceased, but before hormone expression has started. The large number of 
these cells suggests that this intermediate stage of differentiation may last longer than the 
initial neurogenin 3 expressing stage. 



Intestinal expression 

Starting at El 5.5, endogenous p -galactosidase expression can be detected at low 
levels along the brush border of the intestinal villi in both transgenic and non-transgenic 
embryos. Stronger p-galactosidase activity can also be detected in a speckled pattern 
that is most prominent in the small intestine of the transgenic mice but is absent in their 
non-transgenic littermates (Fig. 13 A). Sectioning of the gut reveals that this P- 
galactosidase signal derives from scattered cells within the intestinal epithelium (Figs. 
15A-15C). This pattern of P-galactosidase expression suggests that the neurogenin 3 
promoter is also active in a subset of progenitor cells in the developing gut. These may 
be progenitors for gut endocrine cells. As in the pancreas, this p-galactosidase activity 
partially overlaps endogenous neurogenin 3 expression, again suggesting that the peak of 
P-galactosidase accumulation is delayed relative to neurogenin 3 (Fig. 15C). 

Multiple factors bind to the neurogenic promoter 

To identify nuclear factors that bind to the ngn3 promoter, a series of 
oligonucleotides were synthesized. The oligonucleotides, which spanned potentially 
important DNA binding sites within the promoter, were tested for binding to nuclear 
proteins by electromobility shift assay (EMSA). 

Members of the HNF3 family of winged helix transcription factors have been 
implicated in pancreatic development and islet function (Wu et al Mol Cell Biol 
17:6002-13, 1997; Gerrish et al J Biol Chem 275:3485-92, 2000; Sharma et al Mol 
Cell Biol 17:2598-604, 1997; Duncan et al Science 281:692-5, 1998; Philippe et al 
Mol Cell Biol 14:3514-23, 1994; Kaestner etal Genes Dev 13:495-504, 1999). Based 
on their similarity to a consensus HNF3 binding site (Costa et al Mol Cell Biol 9:1415- 
25, 1989), there are several potential HNF3 binding sites within the 5.7 kb human 
neurogeninS promoter. Two of the most promising sites lie at -3687 bp and at -200 bp. 
EMSA testing of these binding sites showed that both sites bind with high affinity to in 
vitro produced HNF3p (Fig. 16). Using extracts from pTC3 and otTC 1.6 cells, a single 
major complex binds to both sites, and is recognized specifically by an antiserum to 
HNF3p. In addition, co-expression of HNF3P can activate the neurogenin3 promoter in 
transiently transfected 3T3 fibroblast cells (data not shown). 
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[200] The -3687 bp HNF3(3 binding site forms part of a cluster of potential DNA 

binding sites for known pancreatic transcription factors (Fig. 11), including potential 
sites for hox type homeodomain transcription factors, as well as cut-homeodomain 
transcription factor HNF6 and the Pou-homeodomain HNF1 factors (Courtois et al 
Science 238:688-92, 1987). HNF6 binding to the neurogenin3 promoter has been 
demonstrated previously (Jacquemin et al Mol Cell Biol 20:4445-54, 2000). 
[201] An oligonucleotide spanning the potential HNF1 binding site was tested by 

EMS A and found that it can bind to in vitro produced HNF-la. In addition, in nuclear 
extracts from {5TC3 cells, a major low mobility complex binds to the oligonucleotide and 
is recognized specifically by antiserum to HNFlot (Figs. 17). The similar hox 
homeodomain-type binding site immediately downstream of the HNF la binding site 
will not bind HNF la (data not shown). 

HES-1 inhibition of the neurogenic promoter 
[202] It has been proposed that Notch receptor signaling through the transcriptional 

regulator HES-1 may prevent the expression of neurogenin3 in all but a small subset of 
the cells in the developing pancreas (Jensen et al. Nat Genet 24:36-44, 2000). To test the 
ability of HES-1 to directly inhibit the neurogenin3 promoter, the HES-1 cDNA was 
expressed from a CMV promoter-driven expression plasmid in 3T3 cells along with the 

ill neurogenin3 promoter luciferase plasmid (Figs. 18A-18C). HES-1 dramatically and 

□ 

p specifically inhibits the neurogenin3 promoter. Removal of 5' sequences down to -502 

bp does not significantly reduce the ability of HES-1 to inhibit the promoter. 

[203] To further map sequences competent to respond to HES-1 repression, plasmids 

were constructed with either the human ngn3 gene promoter sequence from -208 bp to 
+40 bp (proximal promoter (Fig. 19A)) linked to the firefly luciferase gene, or the 
sequences from -2.6 kb to -208 bp (distal promoter) upstream of the herpes virus 
Thymidine Kinase (TK) promoter linked to the firefly luciferase gene. The small 
proximal promoter retains most of the capacity for HES-1 repression, while the distal 
sequences are repressed weakly by HES-1 . 

[204] Within the proximal 208 bp of the promoter, there are several potential HES-1 

binding sites based on the consensus binding sites for HES-1 (CTNGTG) (Takebayashi 
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et al J Biol Chem 269:5150-6, 1994) and its Drosophila homologs hairy/enhancer-of- 
split (CGCGTC) (Van Doren et al Genes Dev 8:2729-42, 1994; Ohsako et al Genes 
Dev 8:2743-55, 1994) (Fig. 19A). Three oligonucleotides containing four of these sites 
were tested for binding to bacterially produced HES-1 protein by gel mobility shift assay 
(Fig. 19B). All three oligonucleotides bind HES-1, and do so with greater affinity than 
the previously described high affinity tandem sites from the mouse HES-1 gene 
(Takebayashi et al J Biol Chem 269:5150-6, 1994) (labeled HI in Fig. 19C). All four of 
these sequences are conserved in the mouse neurogenin3 promoter (Fig. 11). 

The invention now being fully described, it will be apparent to one of ordinary 
skill in the art that many changes and modifications can be made thereto without 
departing from the spirit or scope of the appended claims. 



